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10 Background of the Invention 

Fidd of the Invention 

The inveation relates to the areas of microbial genetics and recombinant 
DNA technology. The invention provides gene sequences, vectors, 
microotganisms, promoters and regulatory proteins useful for the production of 
1 5 L-lysine. The invention further provides a method to increase the production of 

Uysine. 

RelatedAri 

L-lysine is an important economic product obtained principally by 

industrial-scale fermentation utilizing the Gram positive Corynebacterium 

20 gltiiamicum, Brevibacterium flavum and Brevibacterium lactofermentum 

(Kleemann, A., et. al., Amino Adds, in Ullmann'S ENCYCLOPEDIA OF 

IndustrialChemistry, vol. A2,pp.57-97, Weinham: VCH-Verlagsgesellschaft 
(1985)), 

The stereospecificity of the amino acids produced by fermentation makes 
25 the process advantageous compared with synthetic processes; generally L-foim 
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amino acids are produced by the microbial fermeatation process. Theproduction 
of L-lysine and other amino adds through femientation, utilizing che^ carbon 
sources such as molasses, glucose, acetic acid and ethanol, is a relatively 
inexpensive means of production. 

Microorganisms employed in microbial processes for amino add 
production be divided into 4 classes: wild-type strain, auxotn>phic mutant, 
rpgulatoiy mutant and auxotrophic regulatory mutant (K. Nakayama et ai, in 
NUTRmONAL IMPROVEMEOT OF FOOD AND FEED PROTEINS, M. Friedman, ed., 
(1978), pp. 649-661). 

Several fermentati(n processes utilizing various strains isolated for 
auxotrophic or resistance properties are known in the art for the production of 
L-lysine: U.S. Patent No. 2,979,439 discloses mutants requiring amino acid 
siq)planaitation (homosoine, or L-methionine and L- threonine); U.S. Patent 
No. 3,700,557 discloses mutants having a nutritional requirement L- 
Ihreonine, L-methionine, L-aiginine, L4iistidine, L-leucine, L-isoleudne, L- 
phenylalanine, L-cystine, or Iw^steine; U.S. Patent No. 3,707,441 discloses a 
mutant having a resistance to an L-lysine analog; U.S. Patent No. 3,687,810 
discloses a mutant having both an abiUty to produce L-lysine and a resistance to 
bacitracin, penidUin G or polymyxin; US. Patent No. 3,708,395 discloses 
mutants having a nutritional requirement for homoserine, L-threonine, L- 
threonine and L-methionine, L-leucine, L-isoleucine or mixtures thereof and a 
resistance to L-lysine, L-threonine, L-isoleucine or analogs fliereof; U.S. Patent 
No. 3,825,472 discloses a mutant having a resistance to an L-lysine analog; U.S. 
Patent No. 4, 169,763 discloses mutant strains of Corynebacterium that produce 
L-lysine and are resistant to at least one of aspartic analogs and sulfe dmgs; U.S. 
Palait No. 5,846,790 discloses a mutant stoain able to produce L-glutamic acid 
and L-lysine in Ae absence of any biotin action-suppressing agent; and U.S. 
Patent No. 5,650,304 discloses a strain belonging to the genus Corynebacterium 
or Brevibacterium for the production of L-Iysine that is resistant to 

4-N-(D-alanyl)-2,4-diainino-2,4-dideoxy-L-arabinose2,4-dideoxy-L-arabinose 
or a derivative thereof. 



PCTAJS0OaS617 

WO 01/49854 



-3- 



10 



AconsiderableamountisknownreganfiDgthebi^ 
lAysmcsyn^^inCoo^ebacteriumsp^ts(xo^^^ 
AfuuK Y.Acad. Set 782: 25-39(1996)). EnttyintotheL-lysmepafliway begins 
wiA L-aspartate (see Figure 1), which itself is produced by transamination of 
oxaloacetate. A spedai feature of C g/titomfcam is its ability to convert the 
lysine intermediate piperidine 2,6-dicarboxylate to diaminopimelate by two 
different routes, i.e. by reactions involving succinylated intermediates or by the 
singlereactionof diaminopimelatedehydrogenase. Overall, carbon flux intothe 
pathway isregulatedattwopoints: first,through feedback inhibition of aspartate 
kinase by the levels of both L-threonine and L-lysine; and second through the. 
conttol of the level of dihydrodipicolinate synthase. Hierefore, increased 
production of L-lysine may be obtained in Corynebacteritm species by 

deregulating and increasing the activity of these two enzymes. 

MorciecentdevdopmentsintheareaofL-lysinefeimentativeproduction 

1 5 in Corynebacterium species involve the use of molecular biology techniques to 

augment L-lysine production. Hie foUowing examples are provided as being 
exemplary of the art U. S. Patent Nos. 4,560.654 and 5,236,831 disclose an 
L-lysine producing mutant strain obtained by transforming a host 
Cbowebac/mw« or i?rmbacfen«m spedes microorganism 

20 S-(2-aminoethyl>K:ysteine widi a recombinant DNA molecule wherein a DNA 

fiagment conferring both resistance to S-(2-aminoethyl)-cysteine and L-lysine 
producing abiUty is inserted into a vector DNA; U. S. Patent No. 5,766.925 
discloses a mutant strain produced by integrating a gene coding for 
aspartokinase, originating fromcoryneformbacteria,withdesensitizedfeedback 

25 inhibition by L-lysine and L-tiireonine, into chromosomal DNA of a 

Corynebacterium species bacterium hartx)ring leaky type homoserine 
dehydrogenase or 2. CorynAacterium species deficient in homoserine 
dehydrogenase gene; increased L-lysine production is obtained by gene 
amplification by way of a plasmid vector or utilizing a gene replacement 

30 strategy. European PatentApplicationsEP 0 811 682 A2andEP 0 854 189 A2 
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bothprovideforincreasedproductioiiofL-lysinem Corynebacterium spcdcshy 
way of gene amplification based on plasmid copy number. 

Summary of the Invention 

It is an object of the invention to provide a method to increase the 
production of an amino add in Corynebacterium species by amplifying, i.e., 
increasing, thenumberofagene or genesofan amino acidbiosyntheticpalhway 

in a host ccB. Particularfy preferred Corynebacterium species include 
Corynd>acteritm glutamicum, Brevibacterium flavum, and Brevibacteritm 
lactofermentum. 

It is an object of the invention to provide an isolated feed back resistant 
aspartoldnase enzyme wherein the naturally occurring threonine amino add 
residue 380 in the feedbadc sensitive form is dianged to isoleudne in the ask 
gene of ATCC 21529. It is an object of the invention to provide an isolated ask 
polypeptide comprisingthe amino addsequenceofSEQIDNO:2. Itis another 

object of tiie invention to provide an isolated polynucleotide molecule 
comprisinganucleotide sequence encodingthepolypeptidesequenceofSEQ ID 

N0:2. It is another object oftheinventiottto provide an isolated polynucleotide 
molecule comprising a nucldc add having the sequence of SEQ ID NO: 1 . 

It is anoflier object of the invention to provide a method comprising 
transformingaCoo«e6acferittwspedeshostcellwthapolynucleotidemole^^^ 
comprisinganucleotidesequenceencodingapolypeptidecomprisingaminoadd 

SEQ ID N0:2, wherein said isolated polynucleotide molecule is integrated into 
saidhost cell's chromosome thereby increasing the total number of said amino 
add biosyndietic pathway genes in said host cell chromosome, and selecting a 
transformedhostcell. Itis afarther object ofthe invention to provideamethod 
comprising screening for increased amino add production. The method may 
further comprise growing said transformed host cell in a medium and purifying 
an amino acid produced by said transformed host cell. 
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In another embodiment, a method to increase the production of an amino 
acid is a method conqnising transforaiing a Corynebact&ium species host cell 
with anisolatednucleic acidmolecule mcoding the amino addsequence of SEQ 
ID N0:2, wherein said isolated nucleic acidmolecule is integrated into saidhost 
cell's chromosome thereby increasing the total number of said amino add 
biosyntihietic pathway genes in said host cell chromosome, and wherein said 
isolated nucldc add molecule further comprises at least one of the following: a 
polynucleotide encoding a Corynebacteritm spedes lysine pathway asd amino 
add sequence; a polynucleotide encoding a Corynebacterium spedes lysine 
pathway dapA amino add sequence^ a polynucleotide encoding a 
Corynebacterium spedes lysine pathway dapB amino add sequence; a 
polynucleotide encoding a Corynebacterium species lysine pathway ddh amino 
add sequence; a polynucleotide encoding a Corynebacterium spedes lysme 
pathway UysA amino add sequence a polynucleotide encoding a 
Corynebacterium spedes lysine pathway lysA amino arid sequmce; a 
polynucleotide encoding a Corynebacterium spedes lysine pathway ORF2 ammo 
acid sequence, and selecting a transformed host cell. The method may fiirthar 
comprise growing said transformedhost cell in amedium and purifying an amino 
add produced by said transformed host cell 

The tenn " 'lysA " refers to a truncated lysA gene or amino add sequence 
used by Applicants and described infra. The term ''lysA'' refers to the full length 
lysA gene or amino add sequence used by Applicants and described infra. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising a nucleic add molecule encoding the Corynebacterium 
glutamicum lysine pathway ask amino add sequence of SEQ ID N0:2; and at 
least one additional Corynebacterium spedes lysine pathway grate selected from 
the group consisting of a nucldc add molecule encoding tiie asd polypeptide, a 
nucleic add molecule encoding the dapA polypeptide, a nucleic acid molecule 
encoding the dapB polypeptide, a nucleic add molecule encoding the ddh 
polypeptide, a nucleic acid molecule encoding the 'lysA polypeptide, a nucleic 
acid molecule encoding the lysA polypeptide and a nucleic add molecule 
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encoding ORF2 polypq)tide. In a prcferred embodiment of the invention, the 
isolated polynucleotide molecule conqirises pK184-KDABH'L. In another 
prefored embodiment of the invention, the isolated nucleic acid molecule 
comprises pKl 84-KDAB. In another preferred embodiment of the invention, the 
5 isolated nucleic add molecule comprises pI>2*KDABHL. In another prefmed 

embodiment of&einventiQn,tfaeisolatednucleicacidmoleculecomprisespDll- 
KDABITL. 

It is anotfier object of the invention to provide a host cell transformed 
with an isolated polynucleotide molecule comprising a nucleotide sequence 

1 0 encoding an isolatedpolypeptide comprising the amino acid sequence of SEQ ID 

N0:2, who^ the isolated nucleic acid molecule is integrated into the host cell's 
chrcnnosome thereby increasing the total number of amino add biosyntfaetic 
pathway genes in the host cell chromosome. In one embodiment the 
polynucleotide furtfaercomprisesatleastoaeadditional Corynd?acteriumspecies 

15 lysine pa&way gene selected finom the group consisting o£ a nucldc add 

molecule encoding an asd polypeptide; a nucleic add molecule encoding a dapA 
polypeptide; anucleic addmolecule encoding a dapB polypeptide; anucldc add 
molecule encoding a ddh polypeptide; a nucleic add molecule encoding a 'lysA 
polypeptide; a nucldc acid molecule encoding a lysA polypeptide; and a nucleic 

20 add molecule encoding an ORF2 polypeptide. 

In another embodiment, the polynucleotide further comprises a nucleic 
add molecule encoding a polypeptide wherein said asd polypeptide is SBQ ID 
N0:4; said dapA polypq)tide is SEQ ID NO:6; said dapB polypeptide is S£Q ID 
N0:8; said ddh polypeptide is SEQ ID NO: 10; said 'fysA polypeptide is SEQ ID 

25 NO: 21; said lysA polypeptide is SEQ ID NO: 14; andsaid ORF2 polypeptide is 

SEQ ID NO: 16. 

In another embodiment, the polynucleotide further comprises a nucleic 
add molecule wherdn said asd polypeptide is SEQ ID N0:4; said dapA 
polypeptide is SEQ IDN0:6; said dapB polypeptide is SEQ ID N0:8; said ddh 
30 polypeptide is SEQ ID NO: 10; said 'lysA polypeptide is SEQ ID N0:21; said 
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lysA polypeptide is SEQ ID NO: 14; and said ORF2 polypeptide is SBQ ID . 
NO: 16. 

In another embodiment, the polynucleotide fyrther comprises a nucleic 
acid moleculeencoding the as</ amino acid sequence of SEQ IDN0:4; anucldc 
acid molecule encoding ti»e dapA amino acid sequence of SBQ ID N0:6; a 
nucleic add molecule oicodingthe dapB amino add sequence of SBQ ID NO: 8; 
and a nucldc add molecule encoding 4e ORF2 amino add sequence of SEQ ID 
NO: 16. 

In anolher embodiment, the polynucleotide further comprises a nucldc 
add molecule encoding die asd amino add sequence of SEQ IDN0:4; anucldc 
add molecule encoding the dapA amino add sequaice of SEQ ID N0:6; a 
nucldc addmolecule encoding the dapB amino addsequence of SEQ ID NO:8; 
a nuddc add molecule encoding the ddh amino add sequence of SEQ ID 
NO:10; and a nucldc add molecule encoding the ORF2 amino addsequence of 

SEQ ID NO: 16. 

Jn anotfier embodiment, the polynucleotide fuitiier con[q)rises a nucldc 
addmolecule mcoding the osi annuno add sequence of SEQ IDN0:4i anucldc 
add molecule oicoding die dapA amino acid sequmce of SEQ ID N0:6; a 
nucldc addmolecule encoding die dapB amino add sequence of SEQ IDN0:8, 
a nucldc add molecule encoding the ddh amino add sequence of SEQ ID 
N0:10; anucldcaddmoleculeencodingthe Vjts/i amino add sequence of SEQ 
IDNO: 21; and anucldc addmolecule encoding the ORF2 amino addsequence 

of SEQ ID NO: 16. 

In another anbodiment, flie polynucleotide further cranprises a nucldc 
add molecule encoding the asd amino add sequence of SEQ ID N0:4; anucldc 
acid molecule encoding the dapA amino acid sequence of SEQ ID N0:6; a 
nucleic addmolecule encoding the dapB amino acid sequence of SEQIDNO:8; 
a nucldc acid molecule encoding the ddh amino add sequence of SEQ ID 
NO:10; a nucleic add molecule encoding the amino add sequence of SEQ 
ID NO: 1 4; and a nucleic acid molecule encoding the ORF2 amino add sequence 
ofSEQIDNO:16. 
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In one onbodiment, the traosformedhost cell is a Brevibacterium selected 
from the group consisting of Brevibacterium flavum NRRL-B30218, 
Brevibacterium flavum NRRL-B30219, Brevibacterium lactofermentum 
miJSd^B30220,Brevibacteriumlactofermentu^ Brevibacterium 

5 lactofermentum NRRL-B30222, Brevibacterium flavum NRRL-30234 and 

Brevibacterium lactofermentumlfRB1^30235. Inanotfaereinbodiment^thehost 
cell is Escherichia coli DH5 a MCR NRRL-B30228. In another anbodimcnt, 
the host cell is a C glutamicum selected from the group consisting of C 
glutamicum NRRL-B30236 and C glutamicum NRRL-B30237. 

10 It is auother object of the invention to provide a method of producing 

lysine comprising cultuiing the host ceUs coaq)rising the amino acid sequence of 
SEQ ID NO: 2 wherein said host cells comprise one or more of (a) increased 
enzyme activity of one or more lysine biosynthetic pathway enemies conq)ared 
to the genetically unalt^ed ncmhuman host cell; (b) one or more copies of each 

1 5 gene encoding a lysine biosynthetic pathway enzyme; and, (c) alteration of one 

or more transcription factors regulating transcription of one or more genes 
encoding a lysine biosynthetic pathway enzyme, wherein said host cell produces 
lysine in said culture medium. In one embodiment of the invention, the increased 
enzyme activity conqsrises overexpipssing one or more genes encoding one or 

20 more lysine biosynthetic pathway emymes. In another embodiment of die 

invention flie increased enzyme activity results from the activity of one or more 
modified lysine biosynthetic pathway enzymes wharein said exayme 

m 

modification results in a change in kinetic parameters, allosteric regulation, or 
both, compared to the enzyme lacking the modification. In another onbodiment 
25 of the invention, alteration of one or more transcription factors comprises one or 

more mutations in transcription inhibitor proteins, one or more mutations in 
transcription activator proteins, or both, wherein said one or more mutations 
increases transcription of the target nucleotide sequence compared to the 
transcription by said one or more transcription factors lacking said alteration(s). 
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It is an object of the invention to provide an isolatedpolypeptide, wherein 
said polypeptide comprises an amino acidsequence having at least 95% sequence 
identity to the amino acid sequence of SEQ ID NO: 1 9. It is a further object of the 
invention to provide an isolated polypeptide con^rising the amino add sequence 
5 of SEQ ID NO: 19. It is a furdier object of the invention to provide an isolated 

polynucleotide comprising anucleicaddhaving the sequence of SEQ IDN0:18. 
It is another object of the invention to provide host cell NRRL B30360. 

It is an object ofthe invention to provide an isolatedpolypeptidewherein 
said polypqptide comprises a polypeptide having at least 95% sequence identity 
10 to the amino acid sequence of SEQ ID N0:21. It is a fiirtha- object of the 

invention to provide an isolated polypeptide comprising the amino acid sequence 
of SEQ ID NO:21. It is a further object of the invention to provide a 
polynucleotide molecule comprising a nucleic acid having the sequence of SEQ 
IDN0:2a 

15 It is an object of the invention to provide an isolated polynucleotide 

molecule comprising anucleotide sequence ^coding thepolyp^ptide conqmsing 
the ammo acid sequence of SEQ ID NO:2, further comprising a promoter 
sequence where said promote sequence has at least 95% sequence identity to 
SEQ ID NO: 17. It is a further object of the invention to provide an isolated 

20 polynucleotide molecule comprising a nucleotide sequence encoding the 

polypeptide comprising the amino acid sequence of SEQ ID NO: 2, wherein the 
polynucleotide molecule further comprises the sequence of SEQ ID NO: 17. It 
is a further object of the invention to provide a host cell NRRL B30359. 

Further objects and advantages of thepresentinvention will be clear from 

25 the description that follows. 
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Brief Description of the Figures 

Figure 1. A schematic of the L-lysine biosyndietic paHiwajr in 
Corynebacterium glutamicum (Sahm et aL). 

Figure 2. The nucleotide seqaence of ii;(ATX:C 2 1 529 sequence) (SEQ 
IDNO:l). 

Figwv 3 A, B. The amino acid sequence of ask (ATCC 2 1 529 sequaice) 
(SEQ ID N0:2). 

Figure 4. The nucleotidesequaice of asd (ATCC 2 1529sequence) (SEQ 
IDN0:3). 

Figure 5 A, B. The amino acid sequence of asd (ATCC 2 1 529 sequence) 
(SEQ ID N0:4). 

Figure 6. The nucleotide sequence of dapA (bfKRLr-Bll474) (SEQ ID 

NO:5 ). 

Figure 7. The amino acid sequence of dapA (NRRL-Bl 1474) (SEQ ID 

NO:6). 

Figure 8. The nucleotide sequence of dapB (NRRL-Bl 1474) (SEQ ID 

NO:7). 

Figure 9. The amino acid sequence of dapB (NRRL-Bl 1474) (SEQ ID 

NO:8). 

Figure 10. The nucleotide sequence of ddh (NRRLr-Bl 1474) (SEQ ID 

N0:9). 

Figure 11 A, B. The amino acid sequaice of d<Ui (NRRL-Bl 1474) (SEQ 
ID NO: 10). 

« 

Figure 12, The nucleotide sequence of full length lysA (NRRL-Bl 1474) 
(SEQ ID NO: 1 1) used to obtain the truncated lysA {HysA) nucleotide sequence. 
Underlined region annealed with lysA primer. 

Figure 13. The amino acid sequence of full length lysA (NRRL- 
Bl 1474) (SEQ ID NO. 12) comprising the truncated lysA ( HysA) amino acid 
sequence (SEQ ID NO: 21). Underlined L: the last amino acid residue of lysA 
encoded in the truncated PGR product. 
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Figure 14. The nucleotide sequence of full length lysA (pRS6) (SEQ ID 
NO; 13), 

Figure 15 A, B, C The amino add sequence of fiill length lysA (pRS6) 
(SEQ ID NO: 14). 

Figure 16. The nucleotide sequoice of 0RF2 (NRRL-Bl 1474) (SEQ 
ID NO: 15). 

Figure 17. The amino add sequence ofORF2(NRRL-Bl 1474) (SEQ 
ID NO: 16). 

Figure 18. A schematic depictton of the coastruction of the 5 and 6 
lysine pathway gene constructs of the invention. 

Figure 19. Compsxisoa of die aspartokinase (ask) amino add sequence 
fiom ATCC13032, N13 and ATCX:21529. , 

Figure 20. The nucleotide sequence of tiie Hpal-PvuII fiagment fcam 
pRS6 (SEQ ID NO: 17) conqnising the PI promoter. 

Figure 21 A, B. A schematic depiction of the construction of the 
pDE!ia2-KDABHPlL construct 

Figure 22, A sdiematic depiction of the construction of the pDElia2fcs- 
KDBHL constinct 

Figure 23. The nucleotide sequeaice of truncated 0RF2 (SEQ ID NO: 1 8). 

Figure 24. The amino add sequence of truncated 0RF2 (SEQ ID 
NO: 19). 

Figure 25. The nucleotide sequence of truncated LysA ('lysA)(NRRL- 
B 1 1474) (SEQ ID NO:20). 

Figure 26. The amino add sequence of truncated LysA CLysA)(NRRL- 
B11474)(SEQIDNO:21). 
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Detailed Description of the Preferred Embodiments 

A. Definitions 

In ordar to provide a clear and consistent understanding of the 
specification and claims, including the scope to be given such terms, the 
following definitions are provided. It is also to be noted that the term "a" or "an*' 
entity, refers to one or more of that entity; for example, "a polynucleotide,** is 
understood to represent one or more polynucleotides. 

AUosteric Regulatfon. As used herein, tiie term refers to regulation of 
CT^rme activity through the binding of one or more ligands (allosteri c effectors) 
to one or more binding sites. The ligands may be the same moleculeor difiFarmt 
molecules. The molecules bind to sites on the razyme other than the enzyme 
activesite. As a result ofthe binding, a coiiformatioiial change is induced in Ae 
enzyme whidi regulates affinity of the active site for its substrate or oth^ 
ligands. AUosteric effectors may serve to enhance catalyticsitesubstrate affinity 
(allosteric activators) or to reduce afiSnity (allosteric repressors). Allosteric 
effectors form the basis of metabolic control mechanisms such as feedback loops, 
for example (See, Copeland, Robert A., in Enzymes. A Practical Introduction to 
Structure, Mechanism, and Data Analysis, pages 279-296, Wiley-VCH, New 
Yoik (1996)). 

Amino Acid Biosyntfaetic Pathway Genes. As used herein, the term 
"amino acid biosynthetic pathway gene(s)" is meant to include those genes and 
genes firagments encoding peptides, polypeptides, proteins, and enzymes, which 
are directly involved in the synthesis of amino acids. These genes may be 
identical to those which naturally occur within a host cell and are involved in the 
synthesis of any amino acid, and particularly lysine, within that host cell. 
Alternatively, there may be modifications or mutations of such genes, for 
example, the genes may contain modifications or mutations which do not 
significantly affect the biological activity of tihie encoded protein. Forexample, 
the natural gene may be modified by mutagenesis or by introducing or 
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substituting one or more nucleotides or by rmioving nonessential regions of the 
gene. Such modifications arc readily perfomied by standard techniques. 

Auzotroph. As used herein, the torn refers to astrain of miaxK>rganism 
requiring for growth an extmial source of a specific metabolite that cannot be 
synthesized because of an acquired genetic defect 

Amino Add Supplement As used herein, tibie term refers to an amino 
add required for growth and added to minimal media to support auxotroph 
growth. 

Chromosomal Integration. As used herein, the tena refers to the 
insertion of an oogenous DNA fragment into the chromosome of a host 
organism; mcxte particularly, the term is used to refer to homologous 
recombination between an exogenous DNA fragment and the appropriate region 
of the host cell chromosome. 

Enliancers. As usedherein, die term refers toaDNAsequence which can 
stimulatepromoteractivityandmay be an endogenous element or aheterologous 
element inserted to enhance the level, i.e., strength of a promoter. 

High Yield Derivatiye. As used herein, the temi refers to strain of 
microorganism that produces a higher yield from deictrose of a specific amino 
acid when compared with die parental strain from which it is derived. 

Host CelL As used herein, the term "host cell" is intended to be 
interchangeable with the term "microorganisnL" Where a difference is intended, 
the difference will be made clear. 

Isolated Nndeic Acid Molecule. As usedherein, the term is intended to 
mean a nucleic acid molecule, DNA or RNA, which has been removed from its 
native environment For example, recombinant DNA molecules contained in a 
vector are considered isolated for the puiposesofttie present invention. Further 
examples of isolated DNA molecides include recombinant DNA molecules 
maintained in heterologous host cells orpurified(paitially or substantially) DNA 
molecules in solution. Isolated RNA molecules include in vivo or in vitro RNA 
transcripts of the DNA molecules of the present invention. Isolated nucleic add 
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molecules according to the present invention further include such molecules 
produced synthetically. 

Lysine BiosyuflieticPatliway Protein. As usedherein, the term lysine 
biosynthetic pathway protein*' is meant to include those peptides, polypeptides, 
5 proteins, and enzymes, which are directly involved in the synthesis oflysine finom 

aspartate. Also included are amino add sequences as encoded by open reading 
frames (ORF), where the ORF is associated with a lysine biosynthetic pathway 
operon. These proteins may be identical to those whidi naturally occur within 
a host cell and are involved in the synthesis of lysine within diat host cell. 

10 Alternatively, there may be modifications or mutations of sudi proteins, for 

example, the proteins may contain modifications or mutations which do not 
significantly affect the biological activity of the protein. Forexanq]le,thenatural 
protein may be modified by mutagenesis c^by introducing or substituting one or 
more amino acids, preferably by conservative amino acid substitution^ or by 

IS removing nonessential regi(»is of the protdn. Such modifications are readily 

performed by standard techniques. Aitonatively, lysine biosynthetic proteins 
may be heterologous to tiie particular host cell. Sudi proteins may be from any 
organism having genes encoding proteins having the same, or similar, 
biosynthetic roles. 

20 Mutagenesis* As used herein, the term refers to a process whereby a 

mutation is generated in DNA. With "random" mutagenesis, the exact site of 

■ 

mutation is not predictable, occurring anywhere in the genome of the 
microorganism, and the mutation is brought about as a result of physical damage 
caused by agents such as radiation or chemical treatment rDNA mutagaiesis is 
25 directed to a cloned DNA of interest, and it may be random or site-directed 

Mutatioa. As used herein, the term ref^ to a one or more base pair 
change, insertion or deletion, or a combination thereof, in die nucleotide 
sequence of interest 

Operably Linked. As used herein, the term "operably linked" refers to 
30 a linkage of polynucleotide elements in a functional relationship. A nucleic add 

is "operably linked" when it is placed into a functional relationship widi anotha* 
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nucleic add sequoice. For instance, a inomoto- ot enhance is opmbly linked 
to a coding sequence if it affects the transcription of the coding sequence. 
Operably linked means that the DNA sequences being linked are typicaUy 
contiguous and, where necessary, join two protein coding regions, contiguous 
and in reading fiame. However, since enhancers generally function when 
sq)arated from the promoter by several kUobases and intronic sequences may be 
of variable lengths, some polynucleotide elements may be operably linked but 
not coitiguous. 

Operon. As used ho-ein, Ae term refers to a contiguous portion of a 
transcriptional complex in which two or more open reading fi^es encoding 
polypeptides are tianscribed as a multi-cistionic messenger RNA, controlled by 
a ds-acting promota- and other cis-acting sequences necessary for efiSdent 
ti:anscription, as well as additional ds acting sequences important for effident 
transcription and translation (eg., mRNA stability controlling regions and 
tiranscription termination regions). The term generally also refds to a unit of 
gene expression and regulation, including the stiiictural genes and regulatory 
donoits in DNA. 

Parattal Strain. As used herein, the term refers to a strain of host cell 
subjected to some form of treatment to yield the host ceU of the invention. 

Percent Yield From Dextrose. As used herein, ihe term refers to the 
yield of amino add from dextrose defined by die formula [(g amino add 
produced/ g dextrose consumed)* 100] = % Yield 

Phenotype. As used herein, tiie term refers to observable physical 
characteristics dependait upon the genetic constituticm of a host cell. 

Promoter. As used herein, the term "promoter'' has its art-recognized 
meaning, denoting a portion of a gene containing DNA sequences that provide 
for tire binding of RNA polymerase and imtiation of transcription and thus refens 
to a DNA sequence capable of controlling the expression of a coding sequence 
or functional RNA. Promoter sequences are commonly, but not always, found in 
the 5' non-coding regions of genes. In general, a coding sequence is located 3' to 
a promoter sequence. Sequence elements within promoters that function in the 
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initiadoii of tnaiscription are often characterized by consensus nucleotide 
sequences. The inx>moter sequmce consists of proximal and more distal 
upstream elements (chancers). As used herein, die term "^endogenous promoter" 
refers to a promoter sequence which is a naturally occurring promoter sequence 
in that host microoiganism. The term •'heterologous promoter" refers to a 
promoter sequence whidi is a non-naturally occurring promoter sequence in diat 
hostmicrooiganisnL Hie heterologous occurringpromotersequencemay befrom 
any prokaiyotic or eukaryotic organism. A synthetic promoter is a nucleotide 
sequence, having promoter activity, and not found naturally occurring in nature. 

Promoters may be derived in their entirety from a native gene, or be 
hybrid promoters. Hybrid promoters are composed of different elmients d^ved 
from different promoters found in nature, or even comprise synthetic DNA 
segments. Hybrid promoters may be constitutive, inducible or mviroomentally 
responsive. 

Usefid promoters include constitutive and inducible promoters. Many 
such promoter sequences are known in the art. See, for example, U.S. Pat Nos. 
4,980,285; 5,631,150; 5,707,828; 5,759,828; 5,888,783; 5,919,670, and, 
SambrooK et aL, Molecular Cloning: A Laboratory Manual, 2nd Ed, Cold 
Spring Harbor Press (1989). Othor useful promoters include promoters which are 
neither constitutive nor responsive to a specific (or known) inducer molo^ule. 
Sudi promoters may include diose that respond to developmental cues (such as 
growth phase of the culture), or environmental cues (such as pH, osmoticum, 
heat, or cell density, for example). 

Examples of environmmtal conditions that may effect transcription by 
inducible promoters include anaerobic conditions, elevated temperature, or the 
presence of light It is understood by those skilled in the art that different 
promoters may direct the expression of a gene in different cell types, or in 
response to different environmental conditions. Promoters which cause agene to 
be expressed in most cell types at most times are commonly referred to as 
"constitutive promoters." It is further recognized that since in most cases the 
exact boundaries of regulatory sequences have not been completely defined. 
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DNA fragmeDts of differmt lengths may have identical or similar promoter 
activity. 

Relative Growth. As used hoein^ tiie term refers to a measurement 
providing an assessment of growth by directly comparing growtli of a parental 
strain with that of a progeny strain over a defined time period and wift a defined 
medium. 

Transcription factor. As used herein^ tfie terai "transaiption factor" 
refers to RNA polymerases, and other proteins that interact with DNA in a 
sequ«ce*specific manner and exert transcriptional regulatory eflfects. 
Transcriptional Actors may be ^anscription inhibitoiy proteins or transcription 
activator proteins. In the context of the present invention, binding sites for 
transcription factors (or transcription complexes) are often included in the 
transcriptional regulatory element(s). 

Transcription factor recognition site. As used herein, a ''transcription 
factor recognition site" and a "transcription factor binding site" refer to a 
polynucleotide sequence(s) or sequence motif(s) which are identified as being 
sites for the sequence-specific interaction of one or more transcription factors, 
firequenfly taking tiie form of direct protein-DNA binding. Typically, 
transcription factor binding sites can be identified by DNA footprinting, gel 
mobility shift assays, and the Hke, and/or can be predicted on the basis of known 
consensus sequence motifs, or by other methods known to those of skill in die 
art 

Transcriptional Complex. As used herein, the term "transcriptional 
unit" or "transcriptional complex" refia:5 to a polynucleotide sequence that 
comprises a structural gene (one or more exons), a ds-acting linked promoter and 
one or more other cis-acting sequoices necessaiy for efficient transcription of the 
stmctural sequences, distal regulatory elements necessary for appropriate 
transcription of the structural sequences, and additional cis sequences important 
for efiRcient transcription and translation (e.g., polyadenylation site, mRNA 
stability controlling sequences). See, for example U.S. Patent No. 6,057,299. 
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Transcriptional Regnlatoiy Element As used herein, Ae tenn 
"transcriptional regulatory element" refers to a DNA sequence which activates 
transcription alone ot in combination wifli one or mare other DNA sequences. A 
transcriptional regulatory element can, Tot exan^le, con^e a pramotdr, 
response element, negative regulatory element, sflencer element, gaie 
si^ressor,and/OTaihancer. See, for ecan^le, U.S. Patent No. 6,057,299. 

B. Microbiological and Recombinattt DNA Methodologies 

The invention as provided herein utilizes some methods and teduuques • 
that are known to those skilled in the aits of micn^ology and recombinant DNA 
tedmologies. Methods and techniques for Ae growth of bactoial cells, the 
introducttoi of isolated DNA molecules mto host cells, andtiie isolation, cloning 
and sequencing of isolated nucleic acid molecules, etc., are a few exanq)Ies of 
such me&ods and techniques. These methods and teduuques are desaibed in 
many standard laboratory manuals, such as Davis et al.. Basic Methods In 
Molecular Biology (m6),lK NfiUer, Experiments in Molecular Genetics, Cold 
Spring Harijor Laboratory Press, Cold Spring Harbor, New York (1972); J.H. 
Miller, A Short Course in Bacterial Genetics, Cold Spring Haibor Laboratory 
Press, Cold Spring Harbor, New York (1992); M. Singer and P. Berg, Genes & 
Genomes, University Science Books, Mill Valley, California (1991); J. 
Sambrook, RF. Fritsch and T. Maniatis, Molecular Cloning: A Laboratory 
Manual, 2d ed. , Cold Spring Haibor Laboratory Press, Cold Spring Harbor, New 
Yoik(1989); P.B. Kaufinan etal.. Handbook ofMolecular and Cellular Methods 
in Biology and Medicine, CRC Press, Boca Raton, Florida (1995); Methods in 
Plant Molecular Biology and Biotechnology, B.R. Glick and J.E. Thompson, 
eds., CRC Press, Boca Raton, Florida (1 993); and PJF. Smith-Keaiy, Molecular 
Genetics of Escherichia coli. The Guilford Press, New Yoric, NY (1989), all of 
which are incoiporated herein by reference in their entireties. 

Unless otherwise indicated, all nucleotide sequences newly described 
herein were deteimined using an automated DNA sequencer (such as the Model 
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373 fiom AppUed Biosystems, Inc.). Therefore, as is known in the art, for any 
DNAsequencedetenninedby1hisautoinatedapproach,anynucl^ 
detennined herein may contain some enors. Nudeotide sequences detenmned 
by automation are typically at least about 90% identical, more typicaUy at least 
about 95% to at least about 99.9% identical to the actual nudeotide sequence of 
the sequenced DNA molecule. Hie actual sequence can be more precisely 
determined by other approaches including manual DNA sequencing methods 
wdl known in the art 

m 

hk certain embodiments, polynucleotides of the invention comprise a 
nucleic add, die sequence of whidi is at least 90%, 91%, 92%, 93%, 94%, 95% 
96%, 97%, 98% or 99% identical to a sequence selected from the group 
consisting of SEQ ID NO: 17, SEQ ID NO: 18; and SEQ ID NO:20, or a 
complementaiy sequence tfaoeof 

By a polynucleotide comprising a nucleic add, the sequence of which is 
at least, for exanqjle, 95% "identical" to a reference nucleotide sequence is 
intended diat the nucldc add sequence is identical to the reference sequence 
except that the nucldc add sequence may include up to five mismatches per 
each 100 nucleotides of die reference nucleic add sequence. In other woids, to 
obtain a nucldc acid, the sequence of which is at least 95% identical to a 
reference nucldc add sequence, up to 5% of the nucleotides in the reference 
sequence may be deleted or substituted with another nucleotide, or a number of 
nucleotides up to 5% of the total nucleotides in the reference sequence may be 
insertedintothereferencesequence. The reference (query) sequence may be any 
one of the entire nucleotide sequences shown in SEQ ID NO: 17, SEQ ID NO: 1 8, 
or SEQ ID NO:20, or any fragment of any of these sequences, as described infra. 

As a practical matt», wheAer any particular nucleic add sequence is at 
least 90%, 91 %. 92%, 93%, 94%, 95%, 96%. 97%, 98% or 99% identical to, for 
instance, a nucleotide sequence consisting of SEQ ID NO: 17; SEQ ID N0.:18, 
or SEQ ID NO:20, or a complementaiy sequence thereof, can be deteraiined 
conventionally using sequence analysis computer programs such as a OMIGA® 
Version 2.0 for Windows, availableftom OxfordMoleoular, Ltd. (Oxford, U.BC). 
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OMIGAusestheCLUSTALWalignmentalgorithmusingthesIowMI<^ 
prpgramming alignmeot method ivith default parameters of an open gap penally 
of 10 and an extend gap penally of 5.0, to find the best alignment between two 
nucleotide sequences. When using CLUSTAL W or any other sequence 
aKgnment program to detemiine Aether a particular sequence is, for instance, 
95% identical to a reference sequence according to the present invention, the 
parameters are set, of course, such that the percentage of identity is calculated 
over the full length of the reference nucleotide sequence such that gqjs, 
mismatches, or insertions of up to 5% of the total number of nucleotides in the 
referencesequence are allowed Other sequence analysis programs, knowninlhe 
ait, can be used in die practice of the invention. 

This embodiment of die present invaition is directed to polynucleotides 
comprising a nucleic add, the sequence of which is at least 90%, 91%, 92%, 
93%, 94%, 95%, 96%, 97%, 98% or 99% identical to anucleic add sequenceof 
SEQ ID NO: 17, SEQ ID NO: 18, and SEQ ID NO:20, or a complementaiy 
sequence thereof irrespective of wheflier they have functional activity. This is 
because even where aparticularpolynucleotide does not have functional activity, 
one of skill in the art wouldstiU know how to use thenuddc addmolccule, for 
instance, as a hybridization probe, an SI nuclease mapping probe, or a 
polymerase chain reaction (PGR) prima-. 

Preferred, however, are polynucleotides comprising a nucldc add, the 
sequence of which is atleast90%, 91%, 92%, 93%, 94%,95%, 96%, 97%, 98% 
or 99% identical to a nucldc add sequence of SEQ ID NO: 17, SEQ ID NO: 18 
or SEQ ID NO:20, or a complementary sequence thereof, which do, in feet, have 
functional activity in Corynebacterium spedes. 

By a polypeptide having an amino add sequence at least, for example, 
95% "identical" to areference amino add sequence of a polypeptide is intended 
that the amino add sequence of die claimed polypeptide is identical to the 

referencesequence exceptfhatdieclaimedpolypeptide sequence include up 
to five amino add alterations per eadi 100 amino acids of die reference amino 

addofdiepolypeptide.lnotiierwords.toobtainapolypeptidehavinganamino 
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acid sequence at least 95% identical to a reference amino acid sequence, up to 
5% of fte amino acid residues in the reference sequence may be deleted or 
substituted wifli another amino acid, or a number of amino adds up to 5% of the ^ 
total amino add residues in the refe«:nce sequence may be. inserted mto the 
teferencesequence. ihesealterationsoftherefaencesequencem^occuratlhe 
amino or carboxy terminal positions of the reference amino add sequence or 
anywhere between those temrinal positions, interspersed dAer individually 
among residue, in Ae reference sequence or in one or more contiguous groups 

within Ae refiaence sequence. 

Asapracticalmatter.whetheranyparticularpolypeptideisatleast80%, 

85% 90«/o. 91%, 95%, 96%, 97%, 98% or 99% identical to. for instance, the 
amino add sequence shown in SEQ ID N0:2 or to the amino add sequence 
encoded by a nucldc add sequence can be detemiined conventionaUy usmg 
known computer programs sudi the Bestfit program (Wisconsin Sequence 
Analysis Padcage. Version 8 for Umx. Genetics Computer Group. University 
ResearchPark.575SdenceDrive.Madison.WI53711). When using Bestfit or 
any other sequence aUgnment program to determine whether a particular 
sequence is, for instance, 95% identical to areference sequence according to the 
present invention, the parameters are s et, of course, sudi that the p«x:entage of 
identity is calculated over the full lengfli of the reference amino add sequence 
and that gaps in homology of up to 5% of the total number of amino acid 
residues in tiie reference sequence are aUowed. 

m a spedfic embodiment, the identity between a reference sequence 
(query sequence, a sequence of the present invention) and a subject sequence, 
alsoreferredtoasaglobalsequenceaUgnment,isdetemunedusingtheFASTl>B 

computer programbased onthe algorithm of Bruflag et al (Comp. App. Biosa. 
6 237-245 (1990)). Pref^ parameters used in a FASTDB amino add 
alignment are: Matrix=PAM 0, k-tupl^. Mismatch Penalty=l. Joining 
Penalty=20. Randomization Group Length=0, Cutoff Scor^l. Window 
Siz^equence length. Gap Penalty=5. Gap Size Penalty=0.05. Window 
Size=500 or thelengthofthesubject amino addscquence.whidieverisshorter. 
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According to this embodiment, if the subject sequence is shorter than the query 
sequence due to N- or C-terminal deletions, not because of internal deletions, a 
manual collection is made to the results to take into consideration the feet that 

theFASTDBi«ogram does not account for N-andC-terannal truncations of the 

subjectsequencewhencalculatingglobalpeicentidentity. Forsubjectsequences 

truncated at the N- and C-tennini, relative to the query sequence, the percent 
identity is correctedbycalculatingthenumberofresiduesofthequery sequence 

that are N- and C-terminal of the subject sequence, which are not 
matdied/aligned with a corresponding subject residue, as a percent of the total 
bases of the query sequence. A detemunation of whether a residue is 
matdied/aUgncd is determined by results of the FASTDB sequence alignment 
This percentage is then sublracted from the percent identity, calculated by the 
above FASTDB program specified parameters, to arrive at a final 

percent identity score. This final percent identity score is what is used for the 
purposes of this embodiment Only residues to the N- and C-termini of the 
subject sequence, whidi are not matched/aligned with the query sequence, are 
considered for the purposes of manually a^usting the percent identity score. 
That is, only query residue positions outside the ferthest N- and C-teraiinal 
residues ofthe subject sequence. For example, a 90 amino acid residue subject 
sequence is aligaed with a 100 residue query sequence to determine percent 
identity. Hie deletion occurs at the N-terminus of the subject sequence and 
therefore, the FASTDB alignment does not show a matching/alignment ofthe 
first 10 residues at the N-terminus. Hie 10 unpaired residues rqjresent 10% of 
the sequence (number of residues at llie N- and C-termini not matched/total 
number ofresidues in thequerysequence)solO% is subtractedfiomAepercent 

identity scorecalculatedbytheFASTDBprograra Ifthe remaining 90 residues 
were perfectly matched the final percent identity would be 90%. In another 
example, a 90 residue subject sequence is compared wift a 100 residue query 
sequence. This time the deletions are internal deletions so there are no residues 
at the N- or C-termini of the subject sequence which are not matehed/ahgned 
with the query. In this case the percent identity calculated by FASTDB is not 
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manually carrecteA Once again, only residue positions outside the N- and C- 
tennmal ends of the subject sequence, as displayed in the FASTDB alignment, 
which are not matched/aligned with the query sequence are manually corrected 
for. No oflier manual corrections are made for the purposes of tibis embodiment 

C Methods and Processes of the Invention 

Various ^bodiments of the invention provide methods to increase the 
production of an amino acid and processes for the production of an amino add 
fixMn a Corynebacterium species host cell. Particularly prefened 
Corynebacterium species of the methods and processes of the invention include: 
Corynebacterium glutamicum, Brevibacterium flavum^ Bre\nbacterium 
lactofermentum and otha* Comytiebacteria mdBrevibacteria species known in 
the art 

As will be understood by those skilled in the art, the term 
""Corynebacterium species" includes those organisms previously identifiedin the 
literature as ''Brevibacterium species," for example Brevibacterium flavum and 
Brevibacterium lactofermentum which have now been reclassified into the genus 
Corynebacterium (Int. J. Syst Bacterial 41: 255 (1981)). 

Amino acid biosynthetic pathway genes embodied by the mediods and 
processes describedherein include those for L^lycine, L-alanine, L-methionine, 
L-phenylalanine, L-tryptophan, L-proline, L-serine, I^threonine, L-cysteine, 
L-tyrosine, L-asparagine, L-glutamine, L-aspartic acid, L-glutamic add, L- 
lysine, L-arginine, L-histidine, L-isoleucine, L4eudne, and L-valine 
biosynthesis. Particularly preferred embodiments are drawn to biosynthetic 
pathway genes for L-lysine (Sahm et aLj Ann, N. Y. Acad. Sci. 782: 25-39 
(1996)), L-threonine, L-isoleudne, L-tiyptophan, and L-valine. 

By way of example, the amino acid pathway for L-lysine biosynthesis 
is well known to skilled artisans of amino acid production in Corynebacterium 
spedes. Genes encoding the enzymes important for the conversion of L- 
aspartate to L-lysine include the ask, asd, dapA, dapB^ ddh and lysA genes 
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(Figure 1). Hius, the invention provides herein for exemplaiy puiposcs only, 
specific embodiments utilizing L-lysine biosynthetic pathway genes. Other 
embodiments drawn to the use of biosynthetic pathway genes for the syndesis 
of other amino acids are also encompassed by the invention described herein. 

The methods to increase the production of an amino add and the 
process es for the production of an amino acid of the invention both utilize a step 
requiring the transformation of an isolated nucleic add molecule into a 
Corynebacterium spedes host cell. As known to one skilled in the art, 
transformation of an isolated nucldc add molecule into a host cell may be 
effected by electrcqwiation, transduction or other methods. These mettiods are 
described in the many standard laboratory manuals referenced and incoiporated 
herein. 

The mediods to increase tiie production of an amino add and the 
processes for the production of an amino add of the invention both utilize a step 
requiring an^Uficationofatleast one amino add biosynthesis pafhw^ir gene. As 
known to one skilled in the art, the term ampUfication means increasing the 
number of a gene or genes of an amino add biosynthetic pathway by any means 
known in die art Particularly preferred means of an^lification include. ( 1) Ihe 
addition an isolated nucldc add molecule comprising copies of a gene or genes 
of a biosyndietic pathway by insertion into the dWomosome of a host cell, for 
exan^le by homologous recombination, and (2) the addition an isolated nucldc 
add molecule comprising copies of a gene or genes of a biosynthetic pathway 
into a host cell by wsqt of a self-replicating, extrarchromosomal vector, for 

example, a plasmid. 

Ano&er method of the invention to inoease the production of an amino 
add comprises increasing the expression of at least (me amino add biosynthetic 
pathway gaie. Preferred methods of inaeasing exjMression comprise using 
heterologous promoters, regulated promoters, unr^ated promotas and 

combinations thereof. 

Methods of inserting an isolated nucldc add molecule into the 
chromosome of a host cell are known to those skilled in the art. For example. 
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insertion of isolated nucleic acid molecules into the chromosome of 
Corynebactenim species may be done utilizing thepK184 plasmid described by 
Jobling, M. el aUNuddc Adds Research 18(17): 5315-5316 (submitted 1990). 
Becausfs these vectois lack a Corynebacterium species origin of rcpUcation and 
5 contain a selectable maiker such as kanamycin {kan), cells will only be capable 

of growing under selection if the vector has been inserted into Ae host ceU 
cbromosame by homologous recombination. 

In alternative embodiments, the invention also provides mefliods for 
increasing amino acid production and processes for the production of an amino 
10 add wherein biosynthetic pathway gene an^lification is accomplished through 

the introduction into a host ceU of a self-repUcating, extra-chromosomal vector, 
eg., aplasmid, comprising an isolatednucleic addmolecule encoding an amino 
acid biosynthetic pathway gene or genes. Suitable plasmids for these 
embodiments include pSRl and odier derivatives of pSRl (Archer, J. et ai.. J. 
15 Gen. Microbiol. 139: 1753-1759 (1993)). 

For various embodiments of the invention drawn to a method to increase 
production of an amino add, screening for increased production of an amino 
add,forexan5)le L-lysine, may bedeterminedby direcdy comparingthe amount 
of L4ysine produced in culture by a Corynebacterium spedes host strain to that 
20 of a Corynebacterium spedes transformed host strain in which an amino add 

biosynthesis gene or genes are ampUfied. The levd of production of the amino 
add of choice may conveniently be determined by the following formula to 
calculate the percent yield fixan dextrose: [(g amino acid/L / (g dextrose 
' consumed/L) ] *100. 

25 In one enibodiment, Ae invention provides a method to increase the 

production of an amino add comprising: (a) transforming a Corynebacterium 
species host cell with an isolated polynucleotide molecule comprising a 
nucleotide sequence encoding a polypeptide comprising theamino acidsequence 
of SEQ ID N0:2; (b) amplifying the number of at least one of the biosynthetic 

30 pathway genes for said amino acid in tiie chromosome of said host cell; 
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(c) selectmg a transfomied host cell; and (cQ screening for increased pioductii 
of said amino add bom said transformed host cell relative to said host cell. 









MM 



to increase the induction of an amino acid comprising transforming a 

5 Corynebacterium species host cell with an isolated polynucleotide nK)lecule 

comprising a nucleotide sequence encoding a polypeptide comprising the amino 
add sequence of SEQ ID NO:2; and further comprising at least one of the 
following: a nucldc add molecule encoding a Corynebacterium species lysine 
pathway asd amino add sequence; a nucldc add molecule encoding a 

10 Corynebacterium species lysine pathway dapA amino add sequence; a nucldc 

add molecule encoding a Corynebacterium species lysine pathway dapB amino 
add sequence; a nucldc add molecule encoding a Corynebacterium species 
lysine pathway ddh amino add sequence; a nucldc acid molecule encoding a 
Corynebacterium species lysine pathway *lysA amino add sequence; a nucldc 

IS add molecule encoding a Corynebacterium species lysine pathway lysA amino 

add sequence; and anucldc add molecule encoding a Corynebacterium species 
lysine pathway ORF2 amino add sequence. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule further comprises at least one of the following: a 

20 nucldc add molecule mcoding the asd amino add sequence of SEQ ID NO:4; 

a nucldc add molecule encoding the dapA amino add sequence of SEQ ID 
N0:6; a nucldc add molecule encoding die dapB amino acid sequence of SEQ 
IDN0:8; anucleicaddmoleculeencodingthe ddh amino add sequence of SEQ 
ID NO: 10; a nucldc add molecule encoding the HysA amino acid sequence of 

25 SEQ ID N0:2 1 ; anucleic add molecule encoding the lysA amino acid sequence 

of SEQ ID NO: 1 4; and a nucleic acid molecule aicoding the 0RF2 amino acid 
sequence of SEQ ID NO: 16. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule further conq)rises the following: anucleic acidmolecule 

30 encoding the o^^/ amino acidsequenceof SEQ ID NO:4;anucleicaddmolecule 

encoding the dapA amino acid sequence of SEQ ID NO:6; a nucldc add 
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molecule eacoding Ihe dcq^B amino add sequence of SEQ ID N0:8; and a 
nucleic acid molecule encoding the ORF2 amino acid sequence of SEQ ID 
NO: 16. 

In anotfier particular embodiment of the method, the isolated 
polynucleotide molecule further conqmses the following: anucleic addmolecule 
encoding the asd amino add sequence of SEQ ID NO:4; a nucldc add molecule 
encoding the dapA amino add sequence of SEQ ID NO:6; a nucldc add 
molecule encoding the dapB amino add sequence of SEQ ID NO: 8; a nucldc 
add molecule encoding the liictt amino add sequence ofSEQ ID NO: 10; and a 
nucldc acid molecule encoding the 0RF2 amino add sequence of SEQ ID 
NO: 16. 

In another particular embodiment of the method, the isolated 
polynucleotide molecule furtfaor comprises the following: anucleic addmolecule 
encoding Iheasd amino add sequence of SEQ IDN0:4; anucldcaddmolecule 
encoding the dapA amino add sequence of SEQ ID NO:6; a nucldc add 
molecule oticoding the dapB amino add sequence of SEQ ID NO: 8; a nucldc 
add molecule encoding the ddh amino add sequence of SEQ ID NO: 10; a 
nucldc add molecule encoding the 'fysA amino add sequence of SEQ ID 
NO:2 1 i and a nucldc add molecule encoding the ORF2 amino acid sequence of 
SEQ ID NO: 16. 

In another particular embodiment of the method, the polynucleotide 
molecule further comprises the following: a nucldc add molecule encoding the 
asd amino add sequence of SEQ ID NO: 4; a nucldc add molecule encoding ihe 
dapA amino add sequence of SEQ ID NO: 6; a nucldc add molecule encoding 
the dapB amino add sequence of SEQ ID NO: 8; a nucleic add molecule 
encoding the ddh amino add sequrace of SEQ ID NO: 10; a nucldc add 
molecule encoding the fysA amino acid sequence of SEQ ID NO: 14; and a 
nucldc acid molecule encoding the ORF2 amino acid sequence of SEQ ID 
NO: 16. 
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In another embodiment of the method, the method further conqmses 
growing said transformed host cell in a medium; and purifying an amino add 
produced by said transformed host cell. 

It is anoAer object of the invention to provide an isolated polynucleotide 

5 molecule comprising the polynucleotide molecule comprising a nucleotide 

sequence encoding the polypeptide conning the amino acid sequence of SEQ 
ID N0:2; and at least one additional Corynebacterium species lysine pathway 
gene selected from the group consisting of a'nucldc add molecule encoding an 
osrfpolypeptid^ anucldc addmolecule encoding a dapA polypeptide anucldc 

10 add molecule encoding a dapB polypeptide a nucldc acid molecule encoding 

a ddh polypeptide; a huddc add molecule encoding a 'lysA polypeptide; a 
nucldc add molecule encoding a fysA polypeptide; and a nucldc add molecule 
encodinganO/iF2polypeptide.toapreferredenibodiment,saidfljrfp^^ 
is SEQ IDN0:4; said dapA polypeptide is SEQ ID N0:6; said dapB polypeptide 

15 isSEQIDNO:8;saidrfdfcpolypeptideisSBQIDNO:10; said '/^^polypeptide 

is SEQ ID NO:21; said lysA polypeptide is SEQ ID N0:14; and said 0RF2 

polypeptide is SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 
molecule comprising Ae polynucleotide molecule comprising a nucleotide 
20 sequence encoding the polypeptide comprising the ammo add sequence of SEQ 

IDNO 2; a nucldc acid molecule encoding the osd amino add sequence of SEQ 
ID N0:4; a nudeic add molecule encoding the dapA amino add sequence of 
SEQ ID N0:6; anucldc acid molecule encoding the dopfi amino addsequence 
of SEQ ID NO:8; and a nuddc acid molecule encoding the 0RF2 amino add 

25 sequence of SEQ ID NO: 16. 

It is another obj ect of the invention to provide an isolated polynucleotide 
molecule comprising the polynucleotide molecule comprising a nucleotide 
sequence encoding the polypeptide comprising the amino acid sequence of SEQ 
ID NO: 2; a nucldc acid molecule encoding the asd amino add sequence of 

30 SEQ ID NO:4; a nucldc add molecule encoding the dapA amino acid sequence 

of SEQ ID N0:6; a nucleic add molecule encoding the dapB amino aad 
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sequence of SEQ ID N0:8; a nucleic acid molecule encoding tfie ddh amino 
add sequence of SEQ ID NO: 10; and a nucleic acid molecule encoding the 
ORF2 amino acid sequence of SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 

5 molecule comprising the polynucleotide molecule comprising a nucleotide 

sequence encoding the polypeptide comprising the amino add sequoice of SEQ 
IDN0:2; anucldc add molecule encoding the asd amino addsequence of SEQ 
ID NO:4; a nucldc acid molecule mcoding the dapA amino add sequmce of 
SEQ ID N0:6; a nucldc add molecule encoding the dapB amino add sequCTce 

10 of SEQIDNO:8; anucldc addmoleculeencodingthe amino add sequence 

of SEQ ID NO: 10; a nucldc add molecule encoding the 'fysA amino add 
sequence of SEQ ID NO:21; and a nucldc add molecule encoding the ORF2 
amino add sequence of SEQ ID NO: 16. 

It is another object of the invention to provide an isolated polynucleotide 

15 molecule comprising the polynucleotide molecule comprising a nucleotide 

sequence encoding the polypeptide comprising the amino add sequence of SEQ 
IDN0:2; anucldc addmoleculeencodingthe asd amino add sequence of SEQ 
ID N0:4; a nucldc add molecule encoding the dapA anoino acid sequence of 

4 

SEQ ID N0:6; a nucleic add molecule eacodisxg the dapB amino acid sequence 
20 of SEQ ID NO: 8; anucldc addmolecule encoding the ddh amino addsequence 

of SEQ ID NO: 10; a nucldc add molecule encoding the lysA amino add 
sequence of SEQ ID NO: 14; and a nucleic add molecule encoding the 0RF2 
amino acid sequence of SEQ ID NO: 16. 

It is aftutherobjectofthe invention to provide an isolated polynucleotide 

25 molecule comprising pK184-KDAB. It is a further object of the invention to 

provide an isolated polynucleotide molecule comprising pKl 84-KDABH*L. It 
is a further object of die invmtion to provide an isolated polynucleotide molecule 
comprising pD 1 1 -KDABHX. It is a forflier object of the invention to provide 
an isolated polynucleotide molecule comprising pD2-KDABHL. 

30 It is a further object of the invmtion to provide a vector comprising the 

isolated polynucleotide molecule comprising a nucleotide sequence encoding a 
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polypq>tide conning Ae amino acid sequence of SEQ ID NO 2; and fuilfaa- 
coniprising at least one additional Corynebacterium spedes lysine pathway gene 
selected ftaax the group consisting of a nucleic acid molecule encoding an asd 
polypqptid^ anucldc acidmolecule aicoding a dapA polypqptide; anucleic acid 
mdecule encoding a dcpB polypeptide a nucldc add molecule encoding a ddh 
polypeptide a nucldc add molecule encoding a 'lysA polypeptide; a nudeic 
addmolecule encoding a fysA polypeptide; andanucldc addmolecule encoding 

an OiiF2 polypqptide. 

It is a furdio- object to provide a host cell comprising a vector 
conqnising the isolated polynucleotide molecule comprising a nucleotide 
sequence encoding apolypqjtide comprising the amino addsequence of SEQ ID 
NO 2; and fiirdier am^ing at least one additional Corynebacterium spedes 
lysine pathw^ graie selected fiom llie group consisting of a nuddc add 
molecule encoding an a;^/polypq>tide anucldc addmolecule aicoding a dapA 
polypeptide anucldc addmolecule encoding a dapB polypqitide anucldc add 
molecule encoding a ddh iwlypeptide a nuddc add molecule encoding a 'lysA 
polypq)tide; anucldc add molecule encoding a lysA polypq)tide; and a nuddc 
add molecule aicoding an ORF2 polypeptide. 

It is a further object to provide a host cell wherein said host cell is a 
Brevibacterium selected from the group consisting of Brevibacterium flavum 
NRRL-B30218, Brevibacterium flavum NRRL-B30219, Brevibacterium 
lactofermentumlJRKLrB3Q220, Brevibacterium lactofermentum NRRL-B30221 , 
Brevibacterium lactofermentum NRRL-B30222, Brevibacterium flavum 
NRRL-30234 and Brevibacterium lactofermentum NRRL-30235. In another 
embodiment, the host cell is Escherichia coli DH5 a MCR NRRI^B30228. In 
another embodiment, the host cell is a C glutamicum selected from the group 
consisting of C glutamicum NRRL-B30236 and C glutamicum NRRL-B30237. 

The inventicm provides processes for the production of an amino acid. 
In one embodimait, the invention provides a process for producing an amino 
acid comprising: (a) transforming a Corynebacterium species host cell with an 
isolated nucleic acid molecule; (b) amplifying the number of chromosomal 
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copies of at least one of the biosynthetic pathway geaes for said amino acid; (c) 
selecting a transformedhost cell; (d) growing saidtransfonned cell in amedium; 
and (e) purifying said amino add. 

The inveation is also directed to an isolated polypeptide conprising the 
aminoacidsequenceofSEQIDNO:19. Inoneembodimentoftheinveritioniihe 

polypeptide has at least 95% sequence identity to the amino acid sequence of 
SEQ ID NO; 19. The invention is also directed to an isolated polynucleotide 
molecule con^rising anucleotide sequence encoding the polypeptide of SEQ ID 
NO: 19. In one embodiment, the isolatedpolynucleotide con^rises anucleic add 
having the sequence of SEQ ID NO: 18. 

The invention is also directed to a vector comprising the polynucleotide 
moleculecomprisinganucleotidesequenceOTCodingthepolypeptide comprising 

the amino add sequence of SEQ ID NO: 19. In one embodiment, Ae invention 
is directed to a host cell cotiqnising a vector encoding a polypqitide comprising 
the amino add sequence of SEQ ID NO: 1 9. In one embodim«it, the host cell is 
NRRL B30360. 

The invoition is also directed to a method con^rising transfomMng a 
Corynebacterium spedes host cell with the polynucleotide molecule comprising 
a nudeotide sequence encoding a polypeptide comfxising the amino add 
sequence of SEQ ID NO: 19, and selecting a transformed host cell. In one 
embodiment, the method fintilier comprises screening for increased amino add 
production. In a ptefonred embodiment, the amino add screoaed for is lysine. In 
one embodiment, the polynucleotide molecule is integrated into said host cell's 
chromosome, thereby increasing the total number of said amino addbiosynthetic 
pathway genes in said host cell chromosome. 

In another embodiment, die polynucleotide molecule further comprises 
at least one of the following: (a) a nucldc add molecule encoding a 
Corynebacterium species lysine pafliwayos^ amino acid sequence; (b) a nucldc 
add molecule encoding a Coryttebacterium species lysine pathway asd ammo 
add sequence; (c) anucleic acid molecule encoding a Corynebacterium species 
lysine pathway dapA amino acid sequence; (d) anucleic acidmolecule encoding 
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a Corynebacterium species lysine pathway dapB amino acid sequence; (e) a 
nucleic acid molecule encoding a Corynebacterium species lysine pathway ddh 
amino acid sequence; (f) a nucleic add molecule encoding a Corynebacterium 
species lysine pathway 'lysA amino acid sequence; (g) a nucleic acid molecule 
5 encoding a Corynebacterium species lysine pathway lysA amino acid sequence; 

and, (h) a nucleic acid molecule encoding an 0RF2 polypeptide having SEQ ID 
NO: 16. In this embodiment, the method further comprises screening for 
incited amino acid production. In another embodiment, the amino acid 
screened for is lysine. 

10 In another embodiment of the method, the polynucleotide molecule 

further comprises: (a) a nucleic add molecule encoding the ask amino add 
sequence having SEQ ID NO:2; (b) a nucldc add molecule mcoding a 
Corynebacterium species lysine pathway asd amino add sequence (c) a nucldc 
add molecule encoding a Corynebacterium species lysine pathway dapB amino 

1 5 add sequence; a nucldc add molecule encoding a Corynebacterium species 

lysine pathway ddh amino add sequence and, (e) a nucldc add molecule 
encoding a Corynebacterium species lysine pathway lysA amino add sequence. 
In one embodiment of this method, flie method further con^rises screening for 
increased amino add production. 

20 The invention is also directed to an isolated polypeptide comprising the 

amino add sequence of SEQ ID N0:21 . In one embodiment, the polypeptide has 
at least 95% sequence identity to flie amino add sequence of SEQ ID N0:21. 
The invention also comprises an isolated polynucleotide molecule conqnising a 
nucleotide sequence encoding the polypqptide comprising the amino add 

25 sequence having at least 95% sequence identity to the amino acid sequence of 

SEQ ID NO:21 . The invention is further comprises a polynucleotide molecule 
comprising a nucleic add having die sequence of SEQ ID NO:20, In one 
embodiment the invention comprises a vector comprising the polynucleotide 
molecule comprising anucleotide sequence encoding thepolypeptide comprising 

30 the amino acid sequence having at least 95% sequence identity to the amino add 

sequence of SEQ ID NO:21. The invention further comprises a host cell 
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comprising the vector comprising the polynucleotide molecule comprising a 
nucleotide sequence encoding flie polypeptide comprising flie amino add 
sequence having at least 95% sequence identity to the amino add sequence of 
SEQIDNa21. 

In one embodiment, the invention comprises a host cell selectedfrom 
the group consisting of NRRL B30218, NRRL B30220 andNRRL B30222. 

Hie invention is further directed to a method conqirising transforming a 
Q>ryne6^en«m spedes host ceU with a polynucleotide molecule comprising a 
nucleotide sequence encoding the polypeptide comprising die amino add 
sequence having at least 95% sequence identity to the amino acid sequence of 
SEQ ID NO: 21, and selecting a transformed host cdL Hie method further 
comprisessoeeningforincnasedaminoaddproductioniinparticular^ 

nnlvnucleotide molecule is integrated into 



said host ceU's duomosome, fliereby increasing the total number of said amino 
add biosynthetic pathway genes in said host cell dux«nosome. In one 
enAodiment the method further comprises a polynucleotide molecule further 
comprising atleast one of the following: (a) anucldc addmolecule encodinga 
Conmebacteriumspedeslysmep^y^^ ask mmo^ddscq^ 
add molecule encoding a Corynebacterium species lysine pathway ask amino 
iO add sequence having SEQ ID NO. 2; (c) a nucldc add molecule encoding a 

Corynebacteritmspecieslysincpaiim^yasd amino add sequence; (d) anucldc 
addmolecule encoding a Conmebacterium species lysinepathway dapA amino 
addsequence;(e)anucldcaddmolecdeencodingaQ,rKne6arfer«^ 
lysine pathway dapB amino acid sequence; (f) a nucldc addmolecule encoding 
25 a Corynebacterium species lysine pathway ddh amino add sequence; (g) a 

nuddcacidinoleculeencodingaCo/3«e&flrten«m5pecieslysinepat^^^ 

amino add sequence; and, (h) a nucleic add molecule encoding a truncated 
Corynebacterium species lysine pathway 0RF2 amino add sequence. In one 
embodiment, the method further comprises screening for increased amino add 
30 production. In another embodiment, the amino add screened for is lysine. 
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Another embodimmt of the invention is also directed to an isolated 
polynucleotide molecule comprising a nucleotide sequence encoding the 
polypeptide comprising the amino addsequenceofSEQ ID N0:2 .wherein the 
polynucleotide molecule further comprises a promoter sequence having SEQ ID 
NO: 17. In one embodiment, the promoter sequence has at least 95% sequence 
identity to SEQ ID NO: 17. In one embodiment, the promoter sequence having 
at least 95% sequence identity to SEQ ID NO: 17 is operably directly linked to 
the LysA gene. In anolher embodiment of the invention, Acre is a vector 
comprising the isolated polynucleotide molecule comprising a nucleotide 
> sequence encoding the polypeptide compAsingthe amino acidsequence of SEQ 

ID N0:2, wherein the polynucleotide molecule further comprises a promoter 
sequence wherein said promoter sequence has at least 95% sequence identity to 
SEQ ID NO: 17. In anofliCT aspect of the invention, there is a host cell 
comprising the vector comprising the isolated polynucleotide molecule 
5 comprising a nucleotide sequence encoding the polypeptide conqirising the 

amino acid sequence of SEQ ID N0:2, wherein the polynucleotide molecule 
further comprises a promoter sequence having at least 95% sequence identity to 
SEQ ID NO: 17. In one embodiment, the host cell is NRRL B30359. 

The invention is also directed to a method comprising ttansforming a 
20 CorynOxictenum specibs host cell with the polynucleotide molecule comprising 

a nucleotide sequence encoding fte polypeptide comprising the amino add 
sequence of SEQ ID N0:2, wherein the polynucleotide molecule further 
comprises a promoter sequence having at least 95% sequence identity to SEQ ID 
NO: 17. and selecting a transformed host cell, fa one embodiment, the method 
25 further comprises screening for increased amino acid production, fa another 

embodiment, the amino acid screened for is lysine, fa another embodiment of 
the method, the polynucleotide molecule is integrated into said host ceU's 
chromosome, thereby increasing the total number of amino acid biosynthetic 
pathway gaies in said host cell chromosome, fa anoiha- embodiment of the 
30 method, the polynucleotide molecule further comprises at least one of the 

foUowmg: (a) a nucleic acid molecule encoding a Corynebacterium species 
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lysine pa&way asd amino acid sequence; (b) a nucleic acid molecule encoding 
a Corynebacterium species lysine pathway dapA amino add sequenar. (c) a 
nucleic acidmolecule encoding a Corynebacterium species lysine palhw^ dapB 
amino add sequence; (d) a nucldc add molecule encoding a Corynebacterium 
species lysine pathw^ ddh amino add sequence (e) a nucldc add molecule 
encodingaCb/y«c6flc<en«w*paiejlysinepathwayO/^^ 
(f) anuddc addmoleculeencodingatruncated Corynebacterium species lysine 
padiway ORF2 amino add sequence; (g) a nucldc add molecule encoding a 
Coryndjacterium species lysine pathw^ lysA amino add sequence; and, (h) a 
nucldc add molecule encoding a truncated Corynebacterium species lysine 
pathway lysA amino add sequence. In this embodiment, the method further 
conqjrises screening forinaeasedaminoaddproduction; in particular, for lysine 

production. 












comprises: (a) a nucldc add molecule encodmg a Corynebacterium species 
lysine pathway asd amino add sequence; (b) a nuddc acid molecule encoding 
a Corynebacterium species lysine pathway dapA amino add sequence; (c) a 
nucldc addmolecule encoding a Corynebacterium species lysine pathway dapB 
amino add sequence (d) a nucldc add molecule encoding a Corynebacterium 
species lysine pathway ddh amino add sequence; (e) a nucldc add molecule 
encoding a Corynebacteriumspedeslysmcpz&mayORF2 aminoaddsequenc^ 
and, (f) a nucldc add molecule encoding a Corynebacterium species lysine 
pathway lysA amino add sequence. In this embodiment, the method further 
comprises screening for increased amino acid production. In a preferred 
embodiment, the amino acid is lysine. 

A variety of media known to those skilled in the art may be used to 
support cell growth for the production of an amino add. Illustrative examples 
of suitable carbon sources include, but are not limited to: carbohydrates, such as 
glucose, fructose, sucrose, starch hydrolysate, cdlulose hydrolysate and 
molasses; organic acids, such as acetic acid, propionic add, formic add, malic 
add, dtric acid, and fimiaric acid; and alcohols, such as glycerol, niustrative 
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examples of suitable nitrogen sources include, but are not limited to: ammonia, 
including ammonia gas and aqueous ammonia; anmionium salts of inorganic or 
organic acids, such as ammonium chloride, ammonium phosphate, ammonium 
sulfeteandammoniumacetate;andothernitrogen-containing sources, including 

meat extract, peptone, com steep liquor, casein hydrolysate. soybean cake 

hydrolysate, urea and yeast radract 

A variety of fermentation techniques ate known in the art which may be 
employed in processes of the invention drawn to the production of amino adds. 
Generally, amino acids may be commerciaUy produced from the invention in 
fermentation processes such as the batdi type or of the fed^atch type. In batdi 
type fermentations, all nutrients are added at the beginning of the fenmentation. 
In fedrbatch or extended fed-batch type fermentations one or a number of 
nutrients are continuously suppUedtotheculture,rightfix)mthebegimiiiig 
feraienlationoraftertheculturehasreachedacertainage,orwhenthenutrient(s) 

which ate fed were exhausted fiom the culture fluid. A variant of the extended 
batch of fed-batch type fermentation is the repeated fed-batch or fill-and-draw 
fermentation, where part of flie contents of the fermenter is removed at some 
time, for instance when the fermenter is full, while feeding of a nutrient is 
continued. In this way a fermentation can be extended for a longer time. 

Another type of fermentation, Ae continuous fermentation or chemostat 
culture, uses continuous feeding of a complete medium, while culture fluid is 
continuously or semi-continuously withdrawn in such a way that the volume of 
the broth in the fermenter remains approximately constant A continuous 
fermaitation can in prindple be maintained for an infinite time. 

In a batch fameatation an organism grows until one of the essential 
nutrients in the medium becomes exhausted, or until fermentation conditions 
become unfavorable , the pH decreases to a value inhibitory for microbial 
growth). In fed-batch fermentations measures are normally taken to maintain 
favorable growth conditions, e.g., by using pH control, and exhaustion of one or 
more essential nutrients is prevented by feeding these nutrient(s) to the culture. 
The microorganism will continue to grow, at a growth rate dictated by the rate 
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ofnmrieatfeed GeneraUy a smgle nutrient, very often the carboasoun:e,wfll 
become linuting for growth. lUe san« principle appUes for a coutmuous 
fermentation. usuaUy one nutrient in the medium feed is limiting, all other 
nutrients are in excess. THelhmting nutrient wiU be present in the culture flmd 
5 ataverylowconcentration.oftenunmeasurablylow. Diffecenttypes of nutnent 

li^tationcanbeemployed. CaAon source limitation is most often used. Other 
examples are limitation by the nitn,gen source, limitation by oxygen, limitation 
by a specific nutrient such as a vitamin or an amino add (in case the 
^croorganismis auxotrophic for such a con,K«n^. limitation by sulphur and 

10 limitation by phosphorous. 

The amino acid may be recovaed by any method known in the art 

Exemplary procedures are provided in the foUowing: Van Walsem. RJ- & 
Thompson. M.C.. J. Bioiechnol 5^M1-XZ1 (1997). and U.S. Pat No. 
3 565.951. both of which are incorporated herein by reference. 
15 ' TlieinventiondesmTjedhcreinprovidesisolatednuddcaddmolecul^ 

comprisingatleastoneUysineaminoacidbiosynthesisgene. Unless otherwise 
indicated, all nucleotide sequences described herein were detemunedusmg an 
a^mated DNA sequencer (such as fte Model 373 ftom AppUed Biosystems. 
Inc.) andallaminoaddsequencesofpolypeptidesencodedbyDNAmolecules 

20 desmbed herein were predicted by translation of the relative DNA sequence. 

Therefore, as is known in the art, for any DNA sequence detemrinedby this 
automated approach, any nucleotide sequence determined herein may contam 
some error.. Nucleotide sequences detemrined by automation are typically at 
least about 90-/0 identical, more typically at least about 95% to at least about 
25 99 9% identical to the actiial nucleotide sequence of the sequenced DNA 

molecule. The actual sequence can be more precisely detemiined by other 
approaches including manual DNA sequencing methods weU known in the art 
As is also known in the art, a single insertion or deletion in a detemuned 
nucleotide sequence compared to the actual sequence will cause aframe shift in 
30 translation of the nucleotide sequence such that the predicted amino acid 

sequence encoded by a detemuned nucleotide sequence will be completely 
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different ftom Ae amino addsequace actually encoded by the sequenced DNA 
molecule, beginning at tilie point of such an insertion or deletion. 

The invention provides several isolated nucleic add molecules mcoding 
comprising at least one L-lysine amino add biosynthesis pathway gene of 
Corynebacterium glutamicum. More spedfically, the invention provides the 
following isolated nucldc add molecules: the nucleotide sequence of the ask 
gene from the strain ATCC 21529 (SEQ ID NO: 1); the nucleotide sequence of 
the asd gene from the strain ATQC 21529 (SEQ ID N0:3)i the nucleotide 
sequence of the dqpA gene from the strain NRRL-B 1 1474 (SEQ ID N0:5 ); the 
nucleotide sequence of the dapB gene from the strain NRRL-Bl 1474 (SEQ ID 
N0:7); the nucleotide sequence of the ddh gene from the strain NRRL-Bl 1474 
(SEQ ID N0:9) and the nucleotide sequence of the ORF2 gene from the strain 
NRRL-Bl 1474 (SEQ ID NO: 15). In addition, also provided herein is the 
nucleotide sequence of lysA (SEQIDNO:13)genefrDmplafflmdpRS6 (Marcel, 
T., et aL, Molecular Microbiology 4: 1819-1830 (1990)). 

It is known in the ait that amino acids are encoded at the nucldc acid 
level by one or more codons (code degeneracy). It is also known in &e ait that 
choice of codons may influence expression of a particular amino add sequence 
(protein, polypeptide; etc.). Thus,theinvcntionisfurtherdirectedtonucldcadd 
molecules encoding the ask amino add sequence of SEQ ID N0:2 wherein the 
nucleic add molecule comprises any codon known to encode a particular amino 
add. The invention is also fiuiher directed to nucldc add sequences (SEQ ID 
NOs:l, 3, 5, 7, 9, 11, 13, 15, 18 and 20) which connprise alternative codons in 
order to optimize expression of the protein or polypeptide. 

In addition to the above described isolated nucldc add molecules, the 
invention also provides isolatednucleic addmolecules comprising more than one 
L-lysine Coryfiebacterium glutamicum biosynthesis gene. Such isolatednucldc 
add molecules are referred to as ^'cassette" constmcts. These cassette constructs 
simplify for the practitioner the number of recombinant DNA manipulations 
required to achieve gene amplification of L-lysine biosynthesis genes. 
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an isolatediiucleic add mdecule comprisiiig: (a) a polynucleotide encoding the 
Corynebacterium glutamiam L-lysine pathway ask amino acidsequence of SEQ 
ID N0:2; and (b) at least one additional Corynebacterium species L-lysine 
pathway gene selected fixm Ae group consisting of. (1) a polynucleotide 
encoding the asd polypeptide; (2) a polynucleotide encoding the di^A 
polypeptide (3) a polynucleotide encoding the dapB polypeptide; (4) a 
polynucleotide encoding the ddh polypeptide; (5) apolynucleotide encoding the 
'lysA polypeptide, and (6) a polynucleotide encoding the 0RF2 polypeptide. 

The isolated nucleic add molecules of the invention are preferably 
propagated and maintained in an appropriate nuddc add vector. Methods for 
the isolation and cloning of the isolated nucldc add molecules of the invention 
are weU known to tiiose skilled in the art of recombinant DNA technology. 
Appropriate vectors and methods for use with prokaiyotic and eukaryotic hosts 
are described by Sambrook et al. Molecular aoning: A Laboratory Manual, 
Second Edition. Cold Spring Harbor. N.Y., 1989, the disclosure of which is 
hereby inccMporated by reference. 

A great variety of vectors can be used in the invention. Such vectors 
include chromosomal, episomal and virus-derived vectors, e.g., vectors derived 
ftom bacterial plasmids and firom bacteriophage, as well as vectors derived from 
combinations thereof, such as those derived from plasmid and bacteriophage 
genetic elements, sudi as cosmids and phagemids, all may be used in accordance 
with this aspect of the present invention. Generally, any vector suitable to 
maintain and propagate a polynucleotide in a bacterial host may be used in this 
regard. 

A large numbos of suitable vectors and jH-omot^ for use in bacteria are 
known, many of which are commerdally available. Preferred prokaryotic 
vectors include plasmids sudi as those capable of replication in E. coli (such as, 
for example, pBR322, ColEl, pSClOl , pACYC 184, JtVX). Such plasmids are, 
for example, disclosed by Maniatis, T., et al, In: Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, ColdSpringHaibor,NY(1982)). 
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The following vectors are provided by way of example: pET (Novagen), pQE70, 
pQE60, pQEn9 (Qiagen), pBs, phagescript, psiX174, pBlueScript SK, pBsKS, 
pNHSa, pNH16a, pNHlSa, pNH46a (Stratagene), pTrc99A, pKK223-3, 
pKK233-3, pDR540, pRTTS (Phamiacia). . 

Prefixed vectors for the isolated nucleic acid molecules of the invention 
include the pFCl to pFC7 novel family of combinatorial cloning vectors 
(Lonsdale, D.M., et ai. Plant Molecular Biology Reporter 13: 343-345 (1995)), 
tibe pK184 vector.(Jobling, M.G. and Homes, R.K., Nucleic Acid Research 18: 
5315-5316(1990)). 

Anoflier group of preferred vectors are those that are capable of 
autonomous replication in Corynebacterium species. Such vectors are well 
known to those skilled in the art of amino acid production by way of microbial 
feraientation, exan^les of which include pSRl, pMF1014a and vectors derived 
therefrom. 

The invention provides an isolated amino acid sequence of the ask 
polypeptide of the strain ATCC 21529 (SBQ ID N0:2). The isolated as^ amino 
sequence disclosed herein possesses unique properties with respect to feedback 
resistance of ask enzyme activity to accumulated levels of L-lysine and 
L-threonine in the culture mediunt When conq)ared to the DNA sequences of 
other Coryfiebacterium glutamicum ask-asd gene sequences, Ihe invention 
discloses a threonine to isoleucine change at amino acid residue 380 which 
results in resistance to feedback inhibition. The invention also includes other 
amino acid changes at residue 380 which result in decreased ask enzyme 
sensitivity to L-threonine and/or L-lysine, 

In addition, and as described in more detail herein, the vector may 
contain control regions that regulate as well as eng^ader expression. Generally, 
such regions will operate by controlling transcription, such as inducer or 
repressor binding sites and enhancers, among others. 

Vectors of the present invention generally will include a selectable 
marker. Such marko^ also may be suitable for amplification or the vectors may 
contain additional markers for this purpose. In this regard, vectors preferably 
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contain one or more selectable marker genes to provide a phenotypic trait for 
selection of transfonned host cells. • Such markers include, but are not limited to, 
an antibioticresistance gene such as achloranqihenicol, ampicillin, or kanamydn 
resistance gene, or an autotrophic gene which allows the host cell to grow in the 

5 absence of a nutrient for which the host cell strain is normally auxotrophic. 

If the vector is intended to be maintained in die host cell 
extradbiomosomally, it will contain, in addition and origin of replication which 
will allow it to replicate in the Corynebacterium species host cell. Alternatively, 
if it is desired that the vector integrate into the Corynebacterium species 

10 chromosome, the vector is constructed such that it cannot replicate in 

Corynebacterium. For example, such a vector might be enable of propagation 
in another organism, for example, coli, but lack the proper origin of 
replication to be propagated in Corynebacterium. in anotilier aspect of this 
embodiment, the vector is a shuttle vector which can replicate and be maintained 

15 in more than one host cell species, for sample, such a shutde vector might be 

capable of replication in a Corynebacterium host cell such as a C. glutamicum 
host cell, and also in an £1 coli host cell. 

The invention further provides the following isolated tfie amino add 
sequoaces: the amino add sequence of the asd polypeptide of the strain ATCC 

20 21 529 (SEQ ID NO:4); the amino add sequence of the dapA polypeptide of the 

strain NRRL-B 11474 (SEQ ID N0:6); the amino acid sequence of the dapB 
polypeptide of the strain NRRL-B11474 (SEQ ID NO:8); the amino add 
sequence of the ddh polypeptide of the strain NRRL-Bl 1474 (SEQ ID NO: 10) 
and the amino add sequence of the ORF2 polypeptide of flie strain NRRL- 

25 B 1 1474 (SEQ ID NO: 16). In addition, also provided herein is the amino add 

sequence of lysA (pRS6) (Marcel, T., et al, Mol Microbiol 4: 819-830 (1990)) 
(SEQ ID NO: 14). 

In addition to the isolated polypeptide sequences defined by the specific 
sequence disclosures disclosed above, the invention also provides the amino add 
30 sequmces encoded by the deposited clones. 
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It will be recognized in the art diat some amino acid sequences of the 
invmtion can be varied without significant effect of Ae structure or function of 
the proteins disclosed hemn. Variants included may constitute deleticHis, 
insertions, inversions, repeats, and type substitutions so long as enzyme activity 
is not significantly affected. Guidance conconing which amino acid changes are 
likely to be phenotypically silent can be found in Bowie, J.U., ei aL, 
"Deciphering the Message in Protein Sequences: Tolerance to Amino Acid 
Substitutions," Science 2¥7: 1306-13 10 (1990), 

The strains of the invention may be prepared by any of the methods and 
techniques known and available to those skilled in Ae art Introduction of gene 
constructs of the invention into the host cell can be eflfected by electroporation, 
transduction or other methods. These mediods are described in the many 
standard laboratory manuals referenced and incorporated herein. 

Various embodiments of the invention provide strains with increased 
L-lysine production as a result of gene amplification. By gene amplification is 
meant inoeasing the number of copies above the normal single copy number of 
an L-lysine biosynfliesis pathway gene by a factor of 2, 3, 4, 5, 10, or more 
copies. 

In one embodimoit of the invention, the additional copies of the L-lysine 
biosynthesis pathway gene(s) may be integrated into the chromosome. Anotho: 
embodiment of the invention provides that the additional copies of the L-lysine 
biosynthesis pathway gene(s) are carried extra-chromosomally. Amplifications 
by a factor of 5 or less may be obtained by introducing the additional gene copies 
into the chromosome of the host strain by way of single event homologous 
recombination. Inamostpreferredembodiment,therecombinationeventresults 
in the introduction of one additional copy of the copy of the gene or genes of 
interest If more than 5 copies of the genes are desired, then the invention also 
provides for the use of multicopy plasmids canying the recombinant DNA 
constmct of the invention. 

Representative examples of appropriate hosts for isolated nucleic acid 
molecules of the invention include, but are not limited to, bacterial cells, such as 
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C. glutamicum. Escherichia coli, Streptomyces and Sabnondla typhimuritm 
cells; and fungal cells, such as yeast cells. Appropriate cultun; media and 
conditions for the above-described host cells arc known in the art. 

Particulaily prefared host cells of the invention include: 
Corynebacterium glutamicum^ Brevibacterium flavum and Brevibacterium 
lactofamenium. 

AppUaaits have deposited clmies carrying the pKl 84-KDABH'L multi- 
geneomstructs at an acceptablelntemationalDqKsitaryAuthority in accordance 
with the Budiq)est Treaty aa the International Reception of the Deposit of 
Miaoor^misnis for the Purposes of Patent Procedure. The deposits have bem 
made wifli the Agricultural Researdi Service, Culture CollectiMi (NRRL), 1815 
North University Street, Peoria, Illinois 61604. Deposits made in which Ihe 
pK184-KDAB orpK184-KDABirL multi-gene constructs have beoi integrated 
into flie diromosome of a host cell include the following: (I) the pK184-KDAB 
plasmid, integrated into Ae chromosome, deposited as NRRL-B30219 and 
NRRL -B30221 and (2) Ae pK184-KDABirL plasmid, int^rated into the 
chromosome, depositBdasNRia^B30218,NRRL-B30220,andNRRL-B30222. 

In addition, the pK1844aDABirL multigene consliuct in a plasmid 
configuration, carried in E. coli DH5a MCR, was deposited as NRRL-B30228. 
The six gene construct (pDElia2-KDABHL) was deposited in E. coli (NRRL- 
B30233). C. glutamicum conoprising pK184-KDABIfL was deposited as 
NRRL-B30236. C glutamicum comprising pKl84-KDABHL was dsposiied as 
NRRL-B30237. Brevibacterium flavum conqmsing pDElia2-KDABHL was 
deposited as NRRL-B30234. Brevibacterium lactofermentum conprising 
pDElia2-KDABHL was deposited as NRR1^B30235. 

It is an object of the invention to provide a method of producing lysine 
cominising culturing the host cells conq)rising the amino acid sequence of SEQ 
ID N0:2 wh«cin said host cells conpise one or more of: (a) increased «izyme 
activity of one or more lysine biosyntfaetic pathway enzymes compared to the 
genetically unaltered host cell; (b) (me or more cqiies of each gaie encoding a 
lysine biosynthetic pathway enzyme; and, (c) alteration of one or more 
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transcriptioii fectois r^atiiig transcription of one or more genes encoding a 
lysine biosynthetic pathway enzyme, wherein said host cell produces lysine in 
said culture medium. Ih one embodiment ofthe method, said increased enzyme 
activity comprises overexprcssing one or more genes encoding one or more 
lysine biosynthetic palhway enzymes. In one embodiment ofthe method, said 
one or more genes are operably linked directly or indirecdy to one or more 
promoter sequences. In another embodiment of themethod, said operably linked 

promoter 

sequences are heterologous, endogenous, or hybrid In a preferred embodiment 
ofthe method, said promoter sequences are one or mcMie of: apromoter sequence 
ftom the 5' end of genes endogenous to C. glutamicum, a promoter sequence 
fiom plasmids that repUcate in C. glutamicum, and, a promoter sequence from 
the genome of phage which infect C. glutamicum. In a preferred embodiment of 
flie method, one or more of said promoter sequences arc modified. In anoAer 
preferred embodiment, said modification comprises truncation at the 5' end, 
truncation at the 3' end, non-terminal insertion of one or more nucleotides, non- 
terminal deletion of one or more nucleotides, addition of one ormore nucleotides 
at the S* end, addition of one ormore nucleotides at the 3' end, and, combinations 
dioeof. 

In another embodiment of the method, said increased enzyme activity 
results ftOTi the activity of one or more modified lysine biosynthetic pathway 
enzymes wherein said enzyme modification results in a change in kinetic 
parametas, allosteric regulation, or both, compared to tiie enzyme laddng the 
modification. M one embodiment of the metho4 said change in kinetic 
parameters is a change in or botti. In anoflier embodiment of the 

method, said change in allosteric regulation is a change in one or more enzyme 
allosteric regulatory sites. In one embodiment, said diange in aliostenc 
regulation is a change in the affinity of one or more enzyme allosteric regulatory 
sites for the ligand or Ugaads. The ligands may be the same or different In one 
embodiment, said enzyme modification is a result of a change in the nucleotide 
sequence encoding said emzyme. In one embodiment, said change in said 
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nucleotide sequence is an addition, insertion, deletion, substitution, or a 
combination tiiereof, of one or more nucleotides. 

In another embodiment of flie method, said alteration of one or more 
transcription factors comprises one or more mutations in transcription inhibitor 
5 proteins, one or more mutations in transcription activator proteins, or boA, 

wherein said one or more mutations increases transcription of the target 
nucleotide sequence compared to the transcription by said one or more 
transoription factors lacking said alteration. In one embodinaent, said one or more 
mutations is a change in said nucleotide sequence encoding said transcription 
10 factor. M anoth^ embodimmt, said change in said nucleotide sequence is an 

addition, insertion, deletion, substitution, or a combination thereof, of one or 
more nucleotides. 

All patents and publications referred to herein are C3q)ressly incorporated 
by reference in tiieir entirety. 



15 Examples 

Example 1 

Preparation ofL-Lysine Pathway Multi-gene Constructs 
pK184-KDAB and pK184-KDABH'L 

Applicants have created Hy sine amino acid biosynthetic pathway multi- 
20 gene constructs for the pmpose of amplifying the number of one or more of the 

genes of this pathway in the chromosome of Corynebacterium species. Also, 
through careful study of the L-lysine biosynthesis genes of strain ATCC 21 529, 
Applicants have identified an amino acid change of threonine to isoleucine at 
amino add residue 380 of tiie ask gene of ATCC 21 529. Compared to the DNA 
25 sequences of other Corynebacterium glutamicum ask genes, a threonine to 

isoleucine change at amino acid residue 380 was observed (Figure 19), which is 
responsible for die unusual feedback resistant properties with respect to aspartate 
kinase enzyme regulation. 
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The isolated nucleic acid molecules encoding L-lysine, amino acid 
biosynthesis pathway genes utilized in the presait invention are bom the 
following sources: 



Gene(s) 



Source 



10 



askrosd 

dapA 

dapB 

ddh 

lysA 



lysA 
lysA 
ORF2 



Strain ATCC 21529; 
Strain NRRLBl 1474 
Strain NRRLBl 1474 
StrainNRRLB11474 



PlasniidpRS6 (Marcel, T., etaU MoL Microbiol 4: 819- 
830 (1990)) carrying the lysA gene isolated from strain 
AS019, which was derived from ATCC 13059; 

NRRLB11474; 

NRRL Bl 1474 (full lengdi); and. 
Strain NRRLBl 1474. 



15 



20 



25 



As one skilled in the art would know, the invention is not limited to the 
specific strain origins that Applicants present for the isolated nucleic acid 
molecules of the invention. Any strain of Corynebacterium species, particulariy 
that of Corynebacteritan glutamiam, may be utilized for the isolation of nucleic 
acid molecules that will be used to amplify the number of chromosomally located 
amino add biosynthetic pathway genes. Particularly preferred strains include: 
NRRL-Bl 1474, ATCC 21799, ATCC 21529, ATCC 21543, andB12. 

Methods and techniques common to the art of recombinant DNA 
technology were used in making the multi-gene constmcts of the invention, as 
may be found in the many laboratory manuals dted and incorporated herein,for 
example as found in J. Sambrook, E.F. Fritsch and T. Maniatis, Molecular 
Cloning: A Laboratory Manual, 2d ed. Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York (1989). 

The polymerase chain reaction (PCR) technique is \ised extensively in the 
making of the multi-gene constmcts of the invention. In a typical reaction, the 



wo 01/49854 



PCT/USOO/35617 



-47- 

standanl lOXstocksolutioD (100 mM Tris-HCL, pH 8.3, 500 mM KCL, 1 .5 mM 
MgQz) is diluted to IX for use. Typical reaction conditions were used for PCR 
amplication: 10 mM Tris, pH 8,3 , 50 mM KQ, 1.5 mM MgClj, 0.01% gelatin, 
200 ijM deoxynucleotides, 0.2-1 .0 fuM primers and 2.5 U/1 00/zl pfii polymoase. 
Standard cycling parameters were also employed in PCR reactions: For 30 
cycles, ten^late denaturation was performed at 94 **C for 1 min; 55 ^^C annealing 
temperature was performed for 1 min (or annealing temperature appropriate for 
particular primer pair); product extension was performed at 72 *C for 1 min (if 
product is <500 bp), 3 min (if product is >500 bp); and at the end of cycling, a 
final extension at 72''C for 7 min was performed. 

The primers utilized for cloning experiments included: 
ash 5'-GGGTACCTCGCGAAGTAGCACCTGTCAC-3'; 
asd: 5'-GCGGATCCCCCATCGCCCCTCAAAGA-3'; 
dapB: 5*-AACGGGCGGTGAAGGGCAACT-3'; 
dapA: 5'-TGAAAGACAGGGGTATCCAGA-3'; 
ddh 5*-CCATGGTACCAAGTGCGTGGCGAG-3'; 

5'-CCATGGTACCAC:ACTGTTTCCTTGC-3'; 
argS: 5'-CTGGTTCCGGCGAGTGGAGCCGACCATTCCGCGAGG-3'; and 
lysA: 5'-CTCGCTCCGGCGAGGTCGGAGGCAACTTCTGCGACG-3*, a 
primer that anneals internally to lysA (about 500bp upstream to the end of lysA). 
XysA is a truncated form obtained from lysA. 

Applicants utilized standard PCR and subcloning procedures in cloning 
the coding regions of asknisd, dapB-ORFI-dapAy ddhy 'lysA, and lysA. 
Construction procedures and intermediate plasmids are described in Figure 18. 
Applicants performed the following steps (Figure 18) in constmcting the 
following vectors used in the L-lysine biosynthetic pathway: 

1. pGEMT-ask-asd: an approximately 2.6 Kb PCR product 
containing the askrosd operon of ATCC21529 using primers ask and asd was 
cloned into pGEM-T (Promega pGBM-T vector systems); 
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2. pADM21: an approxiinately 13Kb PGR product (with an 
engineered Kjml site on both prim^) ofNRRL-B 1 1 474 ddh coding region was 
cloned into pADM20; 

3. pUC 1 %-ddh\ an approximately 1.3Kb Kpnl fragment of 
pADM21 containing ddh (NRRL-B11474) was subcloned into pUC 18 at tibe 
Kjmlsite; 

4. pLIC l.7-argS-7>«i4: PCRproductusingtemplateNRRL-Bl 1474 
genomic DNA and primers argS and lysA was cloned into pPMG-LIC cloning 
vector (PharMingen); 

5. pM4-dapB'ORF2'dapA.: an approximately 3 Kb PGR product 
using primers dapB and dapA was cloned into pM4 at &e Xbal site; 

6. pFCi-ask-asd: an approximately 2.6 Kb Nsil-Apal fragment of 
pGEMI-ash^isd was cloned into pFG3 cut with PstI and Apal; 

7. pFGl-Jd/i: "-13 Kb SaU-EcoRI fragment of pUC18-£WA was 
cloned into pFCl cut with Sail and EcoRI; 

8. pFC\-ddh-fysA: an approximately 1.5 Kb EcoRI fragment 
(containing the truncated fysA DNA) of plACU-argS-'lysA was cloned into 
pFCl -ddh at the EcoRI sit^ 

9. pFCS-^i?-^i{F^-<i^:an^>proxiniately3.4KbBamHI-BgIII 
fragment of pM.4-dapB-ORF2-deipA was cloned into pFCS at die BamHI sit^ 

10. pFC5-diy)B-ORF2-dapA-ddh-l)^A: -2.8 Kb Nhel fragment of 
pFCl-ddh-'lysA was cloned into pFC5-dapB-ORF2-dapA at the Nhel sit^ 

11. pFC-3-ask-asd-dapB-ORF2-dapA-ddh-'fysA: -6.2 Kb NrtI 
fragment of pFC5-dapB-ORF2-dapA-ddlt-lysA was cloned into pFd-ask-csd 
at the NotI site; 

12. pDBlia9-ask-asd-dapB-ORF2-dapA-ddh-lysA (pDElia9- 
KDABITL): -8.8 Kb Pmelfragmait ofpFCi-ask-asd-dapB-ORF2-d{q)A-ddh- 
'lysA was cloned into pDElia9 at the EcoRV site; and 

13. pYi\U-ask-asddapB-0RF2-dapA-ddh-HysA (pK184-KDABH'L): 
an approximately 8.8 Kb Pmel fragment of pFC3-ask-cisd-dapB-ORF2-dapA- 
ddh-lysA was cloned into pK184 at the Hindi or Smal site. 
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14. pT?C5Hisk<isd-dapB'ORF2HiapA (pFC5-KDAB): -2.6 KbKpnl- 
Smal fragment of pFC3-ask-asd was cloned into pFCS-dapB-ORFZ-^pA cut 

with Kpnl and Snoal. 

1 5. ^Yim^h<vidrdapB'ORF2'dapA (pKI84-KDAB): -7 Kb Kpn^ 
Pmd fragment of i^VCS-askrOsd-dapB-ORFl-dapA was cloned into pK184 cut 

with Kpnl andHmcIL 

Thus, Applicants havemadelfae following L-lysinemulti-gme constructs: 

1. pK184-KI>ABHX, wherein "'K^'iepresentsanucleotidesequen 
encodingthe a5i!:polypq)tide; '^D^representsanucIeotidesequenceencodiDg&e 
asd polypeptide; "A** represents a nucleotide sequoice encoding the dapA 
polypeptide; "B" represents a nucleotide sequrace encoding the dapB 
polypeptide; "H'" represents a nucleotide sequence encoding flie ddh 
polypq>tid^ and"*L" represoits anucleotidesequence encodingpart of the 'lysA 
polypq}tide. This construct is rcfeired to as a truncated 6 gene construct The 
pKlS4-KDABHL construct, constructed infra, is referred to. as a full length 6 
gene construct 

2. pK184-KDAB, wherein ''K'' represents a nucleotide sequence 
encodingthe asitpolypq)tide;''D''representsanucleotidesequmce encodingthe 
asd polypq>tide; '^A^ represents a nucleotide sequence encoding the dapA 
polypeptide; and "^B** represents a nucleotide sequence encoding the dapB 
polypq)tide. This construct is refenred to as a 4 gene construct 

Both pK184-KDABirL and pK184-KDAB, as do the other constructs 
discussed herein, comprise the nucleotide sequence encoding Xbt ORF2 
polypeptide. 

It should be noted that in addition to the indicated polypeptide sequences 
encoded by the isolated nucleic add sequences rqnesented by *1C*, "D", "A**, 
"B," "L" and "*L", these isolated nucleic add sequences also include native 
promoter elements for the operons represented therein. Thus, the ask-asd 
sequences have been cloned in a fashion that includes the respective native 
promoter elements; the dapA mddapB sequences, representing the operon dapB- 
0RF2-dapA, have been cloned in a fashion that includes therespecti ve promoter 
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elements; the ddh sequence has beoi cloned in a fashion that includes the 
respective native promoter elements, and the lysA and 'lysA sequences have been 
cloned in a fashion that includes a native promoter element 

Alternative gene promoter elements may be utilized in the constmcts of 
the invoition. For example, known bacterial promoters suitable for this use in 
the present invention include die E, coli lad and /acZ promoters, the T3 and 77 
promoters, the gpt promoter, the lambda PR and PL promoters, Ae trp promoter, 
or promota:s endogenous to the bacterial cells of the present invention. Other 
promoters useful in the invention include rpgulated promoters, unregulated 
promoters and heterologous promoters. Many such promoters are known to one 
of skill in the art. See Sambrook, E.F. etal. Molecular Cloning: A Laboratory 
Manual^ 2d ed. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York (1989). 

Example 2 

Two-Fold Amplification of L4ysine Amino Add Biosyntitesis 

Pathway Genes 

For exen^lary purposes only. Applicants provide herein an example 
wherein at least one L-lysine amino acid biosynthesis pathway gene is amplified 
by a factor of 2 by way of (a) the introduction of an isolated nucleic acid 
molecule into a Corynebacterium glutamicum host cell, and (b) the subsequent 
single crossover homologous recombination event introducing said isolated 
nucleic acid molecule into said Coiynebacterium glutamicum host cell 
chromosome. 

As will be understood by those in the art, at least one or two or three or 
four or five or six or seven or eight or nine or ten or more amino acid 
biosynthesis pathway genes may be anQ)lified, /.e., increased in niunber, by a 
factor of at least one or two or three or four or five or six or seven or eight or 
nine or ten fold with minor variations of the example presented hcrem. 
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pK184-KDAB, pK184.KDABH'L and pD2-KDABHL(a fuU length 6 
g&ic construct constracted in Exaniple 4) plasmids were used in the constructicm 
of high yield derivative cell lines of the invmtion. This was accomplished by 
way of introducing plasmid pK184-KDAB, pK184-KDABirL and pD2- 
5 KDABHL DMAs into a Corynebacterium species resulting in incorporation of 

pK184-KDAB, pK184-KDABIfL or pD2-KDABHL into the host cell 
chromosome via a single crossover homologous recombination event 
An^lification of the amino acid biosynthetic pafliway genes by way of 
chromosomal integration of the plasmid constmcts of the invention provided 
10 increased I^lysine production in several Coryftebacterium species strains. 

« 

For cell transformation experiments wi& the isolated nucleic acid 
molecules of the invention, the growth and preparation of competent cells may 
be done according to the following procedure: (1) piddng a &esh, single colony 
of Corynebacterium glutamicum and growing a culture overnight in 10 mL CM 

IS (SMI) in a 250 mL shake flask at 30 degrees Celsius with agitation; 

(2) inoculating 200 mL of "Growth Media** with the overnight culture to an 
optical density (O.D.) of 660 nm of 0. 1 in a 500 mL shake flask; (3) growing the 
culture at 30 degrees Celsius with agitation for 5-6 hours; (4) pouring the culture 
into a chilled, sealed, sterile 250 mL centrifuge bottle; Spin at 8-1 OK for ten 

20 minutes in Refrigerated Sorvall at 4 degrees Celsius; (5) pouring off the 

supernatant thorou^y and resuspending the cell pellet in an equal volume of 
ice-cold, sterile, deionized water, (6) centrifiiging the sample again under the 
same conditions; (7) repeating the water wash remembering to keep everything 
ice-cold; (8) pouring off the supernatant thoroughly and resuspending the cell 

25 pellet in 1 mL of ice-cold, sterile 10% glycerol and transferring the cells to a 

chilled, sterile, 1.5 mL microcentrifuge tube; (9) spin flie sample for 10 minutes 
in arefrigerated centrifuge; (1 0) pipetting off and discarding the supernatant, and 
resuspending the pellet in two to three times the pellet volume (200-400 jiL) of 
10% glycerol; and (1 1) alliquoting, if necessary, the cells into chilled tubes and 

30 freezing at -70 Celsius. 
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pK184-KDAB, pK184-KDABifL and pD2-KDABHL plasmid DNAs 
were introduced into Corynebacteriim gliUamicum h<^ ceih hy Ac 
electroporaticHi piocedure: ( 1 ) pipetting 3 S |jd^ cell/glycerol solution onto the side 
wall of a chilled 0. 1 on electrocuvette; (2) pipetting about 2-4 (iL of plasmid into 
5 Ike solution andmixing the sample by gentlepipetting up and down; (3) bringing 

the entire solution to the bottom of the electrocuvette by gentle tapping, avoiding 
the creation of bubbles; (4) keeping the sample on ice until ready for die 
electroshodc step, wiping off any moisture on the outside of the electrocuvette 
priix to the electroshock administration, and shocking the cells one time at 

10 1.5kV,200£2,25fiF. 

Cells are allowed to recover from electroporation by: (1) immediately 
pipetting 1 mL of warm ""Recovery Media" into the electrocuvette and 
thoroughly mixing the solution by pipetting; (2) incubating the solution (in the 
electrocuvette) at 30 degrees Celsius for at least three hours for antibiotic 

IS resistance expression and cell recovery and (3) plating on selection media and 

incubating at 30 degrees Celsius for 3 days. 

Example 3 

Screening and Selection of Strains with Improved Ir-Lysine Production 

20 After 3 days of growdi, single colonies of antibiotic resistant cells are 

individually selected to determine if there is increased L-lysiae production over 
that which is produced by the parental host cell strain. 

Recipes for all media used in these experiments are found in Tables 1 
and 2. L-lysine production is determined on cultures of transfomied, antibiotic 

25 resistant cells grown in shaker flasks. Briefly, seed media (Table 1), was 

dispensed in 20ml aliquots into deep baffled 2S0ml Bellco shake flasks and 
autoclaved for 20 minutes. After cooling to room temperature, these seed flasks 
were then inoculated with the strain to be tested and placed on a rotaiy shaker. 
They were incubated at 30 degrees Celsius, shaking, overnight. The following 

30 morning, the optical density (wavelength = 660nm) of each seed was recorded. 
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and 2ml of the culture from each seed flask was transferred to all ml aliquot of 
FM3 media, also in a deep baffled shake flask. These ''main** flasks Vfcro thai 
returned to the shakor and incubated at 30 degrees Celsius. 

After 48 hours of incubation, 1 ntil of main culture was removed from 
eac^ flask, and the flasks were pronq)tIy returned to the shaker. Fiomthelml 
san[q)le, optical density was detemiined by diluting 1 :50 in 0. IN HQ to dissolve 
the calcium carbonate present in the media. The remainder of each san^)le was 
then centrifriged to pellet cells and calcium carbonate. A 1:50 dilution of flie 
supernatant was made in water and from this dilution the dextrose omcentration 
was determined. Extracellular Lr-lysine concentrations were also determined at 
this time by HPLC 

High yield derivative cells may be conveniently identifiedby determining 
the percent yield from dextrose, the yield of amino acid from dextrose, 
defined by the formula [(g amino acid produced/ g dextrose consumecQ^ 100] = 
% yield. Results are presented below in which the parental strains B12, NRRL- 
B 1 1474 and ATCC 21799 are transforaied with the L-lysine multi-gene isolated 
nucleic acid molecules of the invention identified as pK184-KDA, 
pK184-KDABHXandpD(Elia)2-KDABHL. ThepD2-KDABHLcQnstmctwas 
made as in Example 4. 



lysine titer L-lyslneyidd Cell Deposit 



strain Tested 




(%) 




NRRL-B 11474 


31 


44 




NRRL-B11474::pK184-KDAB 


32 


45.7 


NRRL-B-30219 


NRRL-Bl 1474:.-pK184-KDABH'L 


36 


51.8 


NRRL-B-30218 


NRRL-B11474:q)DElia2-KDABHL 


38 


54.6 


NRRL-B-30234 


B12 


1.4 


0.9 




E12:.-pK184-KDABirL 


26.8 


38 


NRRL-B-30236 


E12:TDElia2-KDABHL 


29.8 


42.5 


NRRI^B-30237 


ATCC21799 


26.8 


36.9 




ATCC21799:: pK184-KDAB 


28.5 


39 


NRRL-B-30221 


ATCC21799:: pK184-KDABH'L 


31 


43 


NRRL-B-30220 


ATCC21799:: pDElial-KDABHL 


36 


50 


NRRL-B-30235 
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Once high yield derivative cell lines are idmtified, the cell lines are 
further screened to determine fbst amplification of the amino add biosynthetic 
pathway genes has occurred. Aiiq;)lification screening may be conveniently 
accomplished either by (1) standard southern blot methodology to determine 
gene copy number or (2) by a drteraoinatton of the total enzyme activity for 
enzymes encoded by the respective biosynthetic pathway genes of the isolated 
nucleic add molecule introduced into the host cell. 

A determination of gene copy number by Souttionblot methodology may 
be done utilizing standard procedures known in the art of recombinant DNA 
technology, as described in the laboratory manuals referenced and incorporated 
herein, for example as found in J. Sambiook, E.F. Fritsch and T. Maniatis, 
Molecular Cloning: A Laboratory Manual^ 2d ed.. Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New Ycok (1989). 



15 



20 



25 



Table 1. Seed Media, SMI 


Ingredient 


Concentration (g/L) 


Sucrose 


50 


Potassium Phosphate, Monobasic 


0.5 


Potassium Phosphate, Dibasic 


1.5 


Urea 


3.0 


Magnesium Sulfate 


5.0 X 10-' 


Polypeptone 


20 


Beef&Ltract 


5.0 


Biotin 


7.56 X la* 


Thiamine 


3.0x10"' 


Niacinamide 


1.25 X 10r» 


Lr-Metfaionine 


5.0 x. lO-' 


L^Threbnine 


2.5 X. la* 


L-Alanine 


5.0x10-' 


pH 


7.3 
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Table 2. Main Media, FM3 


Ingredient 


Concentration (jg/L) 


Dextrose* 


60 


Annnotdum Sulfate 


50 


Potassium Phoq)hate, Monobasic 


1.0 


Magnesium Sulfate 


4.0 X IQr^ 


Manganese Sulfate 


1.0x10-^ 


Ferrous Sul&te 


1.0 xia^ 


Biotin 


3.0x10^ 


Calcium Carbonate 


50 


Com Steq> Liquor (dissolved solids) 


20 


pH (adjusted widi KOH) 


7.4 



^Dextrose was added after autodaving 



Example 4 

Preparation of L4Lysine Pathway Multi-Gene Construds 

Theinventionfurflier comprises additioiialL-lysinemulti-geae constructs 
constructed using the PCR technique. Standard PCR and subcloning procedures 
were utilized, as described above, to generate 5-gene constructs similar to those 
in Example 1 . The constructs of this example comprise the antibiotic resistance 
gene, chloraniphenicol acyl transferase (CAT). The CAT gene was operably 
linked to a Corynebacteria phosphofructokinase promoter for expression in 
Corynebacteria. 

The following steps were performed in constructing the following 
constmcts containing the CAT gene: 

1. pGEMT-os^-ajJ: -2.6 Kb PCR product containing the ask-asd 
operon of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems); 

2. pUClSndtfA: --1.3Kb Kpnl fragment of pADM21 containing ddfi 
(NRRL Bl 1474) was subcloned into pUC18 at the Kpnl site; 

3. pLICl J-argS-HysA: -3Kb PCR product using template BFIOO 
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genomic DNA and primers argS and lysA was cloned into pPMG-UC cloning 

vector (PharMingen); 

4. pM4-dapB-ORF2'dapA: -3 Kb PCR product using primers dapB 
saddapA was cloned into pM4 at die blunted Xbal site; 
5 5. pPC3^k-asd: -2.6 Kb Nsil-Apal fragment of jiGEMT-ask-asd 

was cloned into pFC3 cut with PstI and Apal; 

6. pFCl-ddh: -1.3 Kb SaU-BcoRI fragment ofpUClS^iwas 
cloned into pFCl cut with Sail and BcoRI; 

7. pFCl-ddh-lysA: -1.5 Kb EcoRI fragmrat (containing the 
10 tnjncated lysA DNA) ofpLlClJ-argS-HysA was cloned into pFCl-ddh at flie 

EcoRI site; 

8. pFCl-ddh-lysA: -2.1 Kb EcoRI-Pstl fragment (containing the 
intact lysA DNA) of pRS6 was cloned into pFCl-drfA cut with EcoRI and PstI; 

9. pFC5'dapB'ORF2'dapA: -3.4 Kb Bamffl-BglH fragment of 
1 5 pM4^pB-ORF2-4apA was cloned into pFC5 at the BamHI sit^ 

10. pFC5^pB-ORF2'dapA'<Idh-lysA: -2.8 Kb Nhel fragment of 
pFCl-ddh-lysA was cloned into p¥C5-dapB'ORF2'dapA at the Nhel site; 

1 1 . pFC5-dapB'ORF2-dapA-ddh-lysA: -3.4 Kb Nhel fragment of 
pFClnidh'lysA was cloned into pFC5-dapB-ORF2-dapA at the Nhel site; 

20 12. pFC3'mk'asd'dapB'ORF2-dapA-ddh-'lysA (pFC3-KDABH'L): 

-6.2 Kb NotI fragment of i^FC5'dapB-ORF2-dapA-ddh'lysA was cloned into 

pFd-askrosd at the NotI site; 

13. pFC3-ask^d'dapB-ORF2-dapA-ddh4ysA (pFC3-KDABHL): 
-6.8 Kb NotI fragment of pFC5-dapB''ORF2'dapA-ddh-lysA was cloned into 

25 pFCi-ask-asd at the NotI site; 

14. pKlM^k^d-dapB-0RF2-dapAM-Hy^^ (pK184.KDABH'L): 
-8.8 Kb Pmel fragment of ipFC2>-ask'<isd^apB-ORF2^apA -ddh- 'lysA was 
cloned into pK184 at the HincII or Smal site; 
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15. ^l>mibL-<isk-asd-dapB<>RF2'dapA'ddh4ysA (pD2-KDABHL): 
-9.4 Kb Pmd fragmoit of ^VCi-ask-asd-dapB-ORF2-dapA-ddh-lysA was 
cloned into pDElia2 at the HincII site (contaiiis the km gcos; is a fiill length 6 
gcDG construct); 

16. pDBUan-ask-asd-dapB-ORF2-dapA-ddh-'lysA (pDl 1- 
KDABITL): -8.8 Kb Pmel fragment of pFC3-ask^d-dapB-ORF2-dapA-ddh- 
'fysA was cloned into pDEIial 1 at the HincII or Smal site (contains tibie CAT 
geasi is a truncated 6 gene construct); 

17. pDEIial \-askrasd-dq;>B-ORF2-dapA-ddh-fysA (pDl 1-KDABHL): 
-9.4 Kb Pmel fragment of pFC3-ask-asd-dapB-ORF2-dcpA-ddh4ysA was 

cloned into pDEIial 1 at the HincH site (contains the C4r gene; is a frill length 
6 gene construct); 

1 8. pDElia2: ~1 .24Kb blunted PstI fragment of pUC4K ligated with 
the ~1.75Kb Dral-Sspl fi:agmait of pUC 19; 

19. pDEIial 1: -IKb PGR product containing the dilQraQq)henicol 
acyl-transferase gene eiqpressed by the C glutamiam fda promoter was obtained 
using primers UCdral andUCsspI andpM4 as tenq>late and was ligated with the 
~1.75Kb Dral-Sspl fragment of pUC19; 

The primers utilized for the cloning procedures included: 
ask 5'-GGGTACCTCGCGAAGTAGCACCTGTCAC-3' 
asd: 5'-GCGGATCCCCCATCGCCCCTCAAAGA-3' 

* 

dapB: 5'-AACGGGCGGTGAAGGGCAACT-3' 
dapA: 5'-TGAAAGACAGGGGTATCCAGA-3' 

ddh\ 5'-CCA TGGTACCA AGTGCGTGGCGAG-3' 

ddhl 5'-(XA TGGTACCA CACTGTTTCCTrGC-3' Kpn I sites:GGTACC 



argS: 5'-CTGGTTCCGGCGAGTGGAGCCGACCATTCCGCGAGG-3' 
lysA: 5'-CTCGCTCCGGCGAGGTCGGAGGCAACTTCTGCGACG-3' 
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a primer that anneals internally to lysA (about 500bp iqpstream to the end of 
lysA). 

UCdral 5*-GGATCTTCACCTAGATCC 
UCsspiy-CCCTGATAAATGCTTC 

«K-, "D'', "A**, "B," "H,'' "L" and "*L" have Ac same designations as set 
forth above. 

Example 5 

Three-Fold Amplification oflAysine Amino Add Biosynthesis 

PathfPoy Genes 

For exffliplaiy purposes only. Applicants provide herdn an example 
wherein at least one L-lysine amino acid biosynthesis pathway goie is amplified 
by a factor of 3. 

Plasmid pDl l-KDABITL (constructed in Example 4) was used in the 
construction of high yield derivative cell lines of the invention. For cell 
transfontnation experiments with the isolated nucleic acid molecules of the 
invention, ike growth preparation of competent cells, and determining of relative 
growth may be done according to the procedure set forth above. 

Plasmid pDl l-KDABITL DNA was introduced into NRRL-B30220 
(comprising pK184-KDABH*L), using the electroporadon method above. 
Introduction of thepDl l-KDABITL plasmid DNA into NRRL-B30220 resulted 
in incorporation of one copy of pDl l-KDABITL into the host cell chromosome 
viaasingle crossover homologous recombination event The host cell comprising 
two copies of five genes (pDl l-KDABITL and pK184-KDABH'L) has been 
deposited as NRRL-B30222. 

The amount of lysine produced by C. glutamicum ATCC 2 1 799 host cells 
having 3 copies of 5 genes (one endogenous copyand one copy of each of pDl 1 - 
KDABITL and pKl84-KDABH'L) is shown below. 
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Lr-lysine Production 



Strains 



L-lysine titer (g/L) L-Iysine yidd (%) 



ATCC 21799 



26,6 45.0 



NRRL-B30222 



32.0 



56.0 



Example 6 



This exan^le describes changing the promoter to increase the level of 
expression of each of these 6 genes desoibed above. Six genes encoding six 
different enyzmes of the biosynihetic pathway from L^asp^tate to L-lysine have 
beea inserted onto the chromosome of Corynebacterittm glutamicum. The 
additional copy of each gene is firom a C glutamicum strain. The nucleotide 
sequences that regulate the level of expression (promoter) for each gene were die 
same as found on the C. glutamicum chromosome at tiie native loci. 

Increased expression can result in increased specific activities of the 
en:grmes and improved flux of caibon from aspartate to lysine. The yield of 
lysine from glucose can be improved by this technique. 

The level of expression from a promoter sequence is referred to as 
strength. A strong promoter gives higjbier expression than a weak one. The 
mechanisms that determine the strength of a promoter have been described 
(Record, M.T., et aL, "Escherichia coli RNA Polymerase, Promotes, and the 
Kinoes of the Steps of Transcription Initiation/' in Escherichia coli and 
Salmonella: Cellular afui Molecular Biology, ASM Press (1996), pp. 792-881). 
Sources of promoters include nucleotide sequences from the 5* end of genes 
native to die C glutamicum chromosome, from sequences on plasmids that 
replicate in C. glutamicum, from sequences in the genome of phage that infect 
C. glutamicum^ or from sequences assembled by humans (tac, trc) and are not 
found in nature. Genes of ribosomal proteins, ribosomal RNAs and elongation 
factors show high levels of expression. The promoters of these genes, are 
candidates for increasing expression of amino acidbiosynthetic pathway genes. 
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Anotheri«s<mfordiangingpromotOT of genes inbiosynth^^ 
is to make the pathway independent of factors fliat control the pathway in the 
wild type organism. For cxanq)le the native promoter oftheopcron that contains 
diaminopimelate decaiboxylase of the lysine biosyntiietic pathway of C 
glutamicum can respond to arginine or lysine in the growth medium. Aigmine 
increased transcription three-fold and lysine decreased transcription by one third 
(Qeaizsi,etaL,JBact 775:7356-7362(1993)). Diaminopimelate decaiboxylase 
activity decreased 60% in cells grown in minimal medium supplemented with 
lOmmM lysine (Cremer et aU J Gen Microbiol 73-^:3221-3229 (1988)). 
Replacing die promoter of lysA which encodes the diaminopimelate 
decarboxylase is one way to make lysine biosynthesis independent of arginine 
and lysine levels in media. 



Example 6A 



15 



Shown below are ecainplesofpromoters that are strongCTthan the askPl 
promoter which regulates the gene for aspartate kinase, the first enzyme in die 
pathway from aspartate to lysine. 



Beta-Galactosidase Assay of Candidate Promoters 



20 



25 



30 



Candidate 


Specific Activity 
micromolAuin/ing 


Origin 


E12 


0.20 


no promoter 


E12^TAC 


49.80 


pKK223-3 


BFIOO 


0.08 


no promoter 


BFlOO^ADlSl.l 


2.22 


aspartoldnase PI 


E12 


0.11 


no promoter 


E124)AD15L1 


1.96 


a^aitokmase PI 


E12/5 


3.46 


BFIOO genome 


E12/7 


.8.60 


BFIOO genome 


E12/I0 


6.56 


BFIOO genome 


E12/32 


3.11 


BFIOO genome 


E12/3 


22.00 


coiynq)hage 


E1239 


11.57 


coiynephage 
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E12 is a C glutamicum strain that does not produce lysine. E12 is a laboratoiy 
strain dmved from ATCC 13059. BFIOO is ahigh level lysine producer (NRRL- 
Bl 1474). TAC is commercially available promoter that has beoi used as an 
example of a strong promoter. Four promoters fi:om the C glutamicum 
chromosome and three from a phage have been identified that are stronger ihm 
the native aspartokinase promoter. 

Example 6B 

Examples of strong promoters increasing specific enzyme activity of 
aspartokinase when expressed in C glutamicum are shown below. 



Influence of IPTG on Aspartokinase activity 



Strain 


Regolatoi^pcomot^-geae 


Indacer 


fimnlAntn/mg 


BFIOO 


none 


none 


110 


?D9trc-ask 


ladltrc-ask 


none 


103 


VWtrc-ask 


lacOtrc-ask 


+IPTG(30mg/L) 


269 


131-2 


lacl/trc-ask 


none 


59 


131-2 


ladltrc-ask 


-HPTG (30 mg«-) 


117 


131-5 


lacl/trc-ask 


none 


59 


131-5 


ladltrc-ask 


+IPTG(30mgflL) 


123 



pD9 is a pbsooid that replicates in C gfiitamicum. 



13 1 strains have the trc-ask constnict integrated into the genome. 
IPTG induces genes controfled by the TRC promoter. 
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Example 6C 

Examples of the influence of lacl/trcnisk on lysine production in shake 
flasks are shown below. 



StnuD: 


Indaction 


O.D. 


Titre 


Yield 


S.P. 


BFIOO 


none 


46 


26 


43 


58 


¥D9trc-ask 


none 


49 


30 


49 


61 


?D9trc-ask 


+IPTG 


45 


30 


50 


68 


BFIOO 


none 


43 


23 


39 


53 


131-2 


none 


34 


27 


46 


82 


131-5 


none 


35 


28 


47 


82 


O.D. - optical density at 66Qiim 


Titre - grams Lysme/liter 


Yield = grams lysine made/gn 


ims dextrose consiuned 


S.P. = grams tysme/O.D. 



The production of lysine by BFIOO was improved by increasing the 
strength of the aspartokinase promoter. 

Example 7 

This example demonstrates the use of vector pDEd2i2-ask'asd'dapA- 
ORF2'dapB-ddh-PlfysA (pDElia2KDABHPlL) in the construction of the high 
yield cell lines of the invention. The Hpal-Pvall fragment containing the PI 
promoter was prepared as described in Marcel T., et al. Molecular Microbiology 
4:1819-1830 (1990), Applicants utilized standard PGR and subcloning 
procedures as set forth above. For cell transformation experiments with the 
isolated nucleic acid molecules of the invention, the growth preparation of 
competent cells, and determining or relative growfli may be done according to 
the procedure set forth above. 
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Applicants performed Ae following steps in constructing flie following 
vectors used in the L-lysine biosynthetic pathway. 

1. pGEMT-ask-asd: ~2.6 Kb PGR product containing the ask-asd 
opaan of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems). 

2. pUC\S-ddh: -1.3 fragment of pADM21 containing ddh 
^FlOO locus) was subcloned into pUClS at the Kpnl site. 

3. pFC3-ask-asd: -2.6 Kb Nsil-Apal fra^ent of pGEMT-ask-asd 

was cloned into pFC3 cut witii Psfl and Apal. 

4. pFC3-<i975-OiiF2-<i^: -2.9 KbPCRproductofNRRL-Bl 1474 

dapB-ORF2-dapA coding r^on was cloned into pFC3 at the EcoRV site. 

5. pFCl-ddh: -1.3 Kb Pstt-EcoRI fragment of pUC18-rfd/» was 
cloned into pFCl cut with PstI and EcoRI. 

6. pUC19-Pl: -550 bp Hpal-PvuH fi:agmait (containing flie first 
promoter, PI, of the argS-lysA operon) of pRS6 was cloned into pUC19 at the 
Smal site. 

7. pUC19-Pl/>!S^:-1.45KbproinoteriessPCRproduct,usingprimer 
LysA{ATG) and LysA^B, of NRRL-B11474 fysA coding region is cloned into 

pUC19-Pl at die HincH site. 

8. pFCl-f llysA: ~2 Kb EcoRI-HindHI fragment of pUCl?-Pl/:)»i4 

was cloned into pFCl cut with EcoRI and HindlH. 

9. pFCl-VllysA-ddh: ~1 .3 Kb EcoRI-NotI fragment of pFCl-ddh 
was cloned into pFCl-Pl/>Si4 cut with EcoRI andNotI, 

10. pFCl-ask-asd-<idh-P llysA: -2.6 Kb Swal-Fsel fragment of pFC3- 
ask-asd-vtas clcmed into pFC\-ddh-fllysA cut with Swal and Fsel. 

11. pFC3-ask-asd-dapB-ORF2-dapA-ddh-PllysA (pFC3- 
KDABHPIL): -5.9 Kb Spel fragment ofpFCl-ask-asd-ddh-PllysA was cloned 

into pFC3-dapB-ORF2-dapA at the Spel site. 

12. pDBlia2'ask-asd-dapB-ORF2-dapA-^h-PUysA (pDElia2- 
KDABHPlL):~8.8KbPmeIfragmentofpFC3-flsfc^d-^a/?£-Oi?F2-Jfl;z^-rfi/A- 

¥llysA was cloned into pDElia2 at the HmcII site. 
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Primeis used in PGR: 

/>^y.4(ATG): CCGGAGAAGATGTAACAATGGCTAC 
l>!Si43B: CCTCGACTGCAGACCCCTAGACACC 

The nucleotide sequence (SEQ ID NO: 17) of the Hpal-PvuII fragment 
S containing the promoter PI is shown in figure 20. Results of lysine production 

in hfRRL-B11474 conqmsing the ^I)B\\^-ask-<ad'-dapA'^ 
PlfysA (pDElia2 KDABHPIL) ccmstruct are shown below. 

Strain tested lysine lysine yield (%) cell deposit 

titer 

NRRL-B11474 30 35 

10 NRRL-B11474:5)DEIia2.KDABHPlL 37 42.8 NRRLB30359 

Example 8 

This example demonstrates the use of vector pD^al^cs-^k-asd-dapB" 
ddh'lysA (pDElia2fcsKDBHL) in the construction of the high yield cell lines of 
the myention. The pDBlia2K:s^BHL vector comprises a truncated ORF2 gene 

IS and lacks a dapA gene. The ORF2 gene was cleaved at an internal Clal site, 

removing the 3' region and the dapA gene. A promoterless lysA gene was 
obtained from NRRL-B 11474. For cell transformation experiments with the 
isolated nucleic acid molecules of the invention, the growth prq)aration of 
competent cells, and determining of relative growth may be done according to 

20 the procedure set forth above. Applicants performed tiie following st^s in 

constmcttng the following vectors used in flie L-lysine biosyntfaetic pathway. 

1. pGEMT-osArOs^/: -2.6 Kb PGR product containing the ask-asd 
operon of ATCC21529 using primers ask and asd was cloned into pGEM-T 
(Promega pGEM-T vector systems). . 

25 2, pVCi-ask-asd: -2.6 Kb NsU-Apal fragment of pGEMT-a^A:-^^^ 

was cloned into pFC3 cut with PstI and Apal. 
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3. pFC3-<iapB-ORF2-dapA:~2SKbPCKproductofNRSa^ 
dapB-OSF2-dapA coding region was cloned into pFC3 at the EcoRV site. 

4. pFC3-dapB:^helaIgeClalf[ag^le^AofpFC3-di^B-OIU^^ 
was religated. 

5. pUC 1 &~ddh: ~1 .3 Kb Kpal fiagmrait of pADM2 1 containing ddh 
(NRRL-Bl 1474 locus) was subcloned into pUC18 at the Kpnl site. 

6. pFCl-ddh: -1.3 Kb Sall-EcoRI fiagment ofpUCli-ddh was 
cloned into pFC 1 cut with Sail and EcoRL 

7. pFCl-<idh-fysA: ~2.1 Kb EcoRI-PstI fragment (contaming the 
intact fysA DNA) of pRS6 was clone into pFCl-ddh cut with EcoRI and PstL 

8. pVCl-askrasd-ddh-lysA: -2.6 Kb Swal-Fsel fiagmaat of pFC3- 
ask-asd was cloned into ipFCl-ddh-ly^A cut with Swal and FseL 

9. pFCS-askHisd-dapB-^dh-fysA: -6 Kb Spelftagmentof pFCl^A:- 
asd-ddh'lysA was cloned into pBCi-dapB at the Spel site. 

10. pim\id2^-ask-asd-dapB-ddh4ysA (pDElia2Fcs-KDBHL): -73 
Kb Notl-Pmel fragment of pFCS-ask-asd-dapB-ddh-lj^A was cloned into 
pDElialfcs cut with NotI and Pmel. 

1 1 . pDElia2K:5: thesmall PvuIIfragtnent of pFC5 was ligated with the 
large PvuII fragment of pDElia2. 

Results of lysine production in NRRL-Bl 1474 comprising &e 
pDEHsil^'^k-asd-dqpB-ddh'lysA (pDElia2jc$KDBHL) are shown below. 
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Strain tested lysine titer lysine yield (%) cell deposit 

NRRL-B11474 31 49 

NRRL-Bl 1474:i)DElia2pcs-KDBHL 3X8 58 NRRL B30360 

♦ ♦ ♦ ♦ ♦ 

Having now fully described the present invention in some detail by way 
of illustration and example for purposes of clarity of und^standing, it will be 
obvious to one of ordinary skill in the art that same can be performed by 
modifying or changing the invention with a wide and equivalent range of 
conditions, formulations and other parameters thereof, and that such 
modifications or changes arc intended to be encompassed witiiiin the scope of the 
appended claims. 

All publications, patents and patent applications menticmed in this 
specification are indicative of the level of skill of those skilled in the artto whidi 
this tnvoition pertains, and are herein incorporated by reference to the same 
extent as if each individual publication, patent or patent application was 
specifically and individually indicated to be incorporated by reference. 
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AppUcanfs or ageofs file 

refcMce ouiriber: 1533.103PO.- 



intematioiial 9pp\icad^t& 

TBA 



INDICATIONS RHATING TO A DEPOSITED MICROORGANISM 

(PCI Rule ISfcis) 




indications 
R. IDENTIFICATION OF DEPOSIT 

Name of deposftoiy mstitndon 

Agricultural Research Culture Collection (NRRL) 

Anldmg nf depoaitmy institiitmn (bidudiiif postal code and courUrj) 

1815 N. Univttsity Street 
Peoria, XUinois 61604 
United States of America 



Fbitfaer deposits me identified on an Additio&al sheet O 



Dateof deposit 16 Septemb^ 1999 

(16.09.99) 



Accession Number 

NRRLB-30218 



C ADDITIONAL INDICATIONS (leave Nank fftua qpplhable) 

A::pK184-KDABDT. 



Ibbinfonnatian is Gontmued on an additional sheet Q 



D. DESIGNATED STATES TOR V¥fflCH INDICATIONS ARE MADE (^Ote adUuUUmM an not for aUdesignaied States) 



E. SEPARATE FURNISHING OF INDICATIONS Otaf€ ttank (foot appUcabie) 



The indications listed Wow wiU be submitted to tiie interoatioaal Bureau later (specify the generoi name of the indkaiums, & g., 
"Accession Niatdter of Deposit") 





• 


^ Hiis sheet was received with the intematioiial qipLication 


□ Hiis sheet was received by the International Bureau on: 


Authonzed officer a 


Authorized officer 



fkm PCr/RQ/1 34 (loJy 1992) 
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Applicant!! or ageotfs file 


Inteiiiatjcinal applicatiojP^C^ 


nftiBoce nninber lS33.103PCv«/ 


TBA 



INDICATIONS RELATING TO A DEPOSFTED MICROORGANISM 

(PCX Rule 13bis) 



A. The indications made below relate to the nucroorganism refeored to in the desciiption on page line .3_, 



B. IDENTIFICATION OF DEPOSIT 



Fiiiflker deposits are identified on an additional sheet B 



Name of depositoiy insdlution 



Agricultuial Research Cultuie Collection (NRRL) 



Addtess of depository instiiutioQ f/teiiid£Kf fof^ 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Daieof deposit 16 Septemb^ 1999 

(16.09.99) 



Accession Nuniber 

NRRLB-30219 



C ADDITIONAL INDICATIONS (Um l^ank ffnot tq^flkabk) 



This infi»Diation is continued on an additional sheet Q 



B::pK184-KDAB 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE Oftheiadicatiataans notJbraU dcsignaUdStaUj) 



E. SEPARATE FURNISHING OF INDICATIONS OmbM^noeappUaibie) 



The indications listed below will be submitced lo the Inlrmational Bureau later (specify the general nature of the indicatiQiu, e.g., 
''Accession Number of Deposit") 







^^^Ihis sheet was received with the international applicadon 


□ Ihis sheet was received by the International Bureau on: 


Autfaotized officer ^ 


Authorized officer 



Focm P<nVROA34 (July 1992) l03dcpB302l9.jp 
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Apidicant's or agoif s file 

1533.103PC0. 



loteniadoaal applicaticfi^G 

TBA 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13to) 



The indications made below relate to the micaroorganism refcned to in ihe desciqXioii on page ,8^ line jl^. 




B. IDENTIFICATION OF DEPOSIT 



Further deposits are identified on an additional sheet ^ 



Name of depository institutioo 

Agricultural Research Culture Collection Q<RRL) 

Address of depooUxy histitiitioa (induding postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 16 Septemb^ 1999 

(16.09.99) 



Accession Number 

NRRLB-3Q220 



C ADDITIONAL INDICATIONS (Uave blank if not applicabit) 

CripKlS^-KDABDl. 



This infiDrmation is continued on an additional sheet ^ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ((fife indjcations are not far oU desiptated States) 



E. SEPARATE FURNISHING OF INDICATIONS {ka^bM^nctappUaaM) 



The indications listed below wiU be submitted to the international Bureau later (^c(fr the general nature of the indicatuMS, e.g.. 
"Accession Number of Deposit") 







This sheet was received with the mtemational application 


□ This ^eet was received by the International Bureau on: 


Authorized officer 


Authorized officer 



B>rmFCiyR0^134(Jaty 1992) 
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Aiq;)!icant^5 or agent's file 

lefiecence numbert 1533.103Pa/. 



International appHcatic)?^- 

TBA 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(per Role 13friy) 



A, The indications made below relate to the microorganism refened to in t he description on page ^8^ line 

Foitfaer deposits are identified on an additional sheet ^ 



B. IDENTIFICATION OF DEPOSIT 
Name of depoutoiy institution 

Agricultural Research C ulture Collection (NRRL) 

Address of deposittoy institution (including postal code and country) 

1815 N. University Street 
Peoria, Illinois 61604 
United States of America 



Dateofdq)Osit 16 September 1999 

(16.09.99) 



Accession Number 

NRRLB-30221 



C. ADDmONAL INDICATIONS (teavtf^^fln* if fwrflpf/fcfll^ie; 



This infomuuion is continued on an additional sheet Q 



D::pK184-KDAB 



D. DESIGNATED STATES FOR WfflCH INDICATIONS ARE MADE (tfthe vulkatioMarenatfia^aUdesiffia^Sutus) 



R SEPARATE FURNISH ING OF INDICATIONS 

The indications listed below wU be submitted to ±e inl«natioDal Bureau later (spe^^ Ihe genenU nature of the indications, eg,, 
''Accession Number of Deposit'') 







j^l^ltds sheet was received witti the intematioaal appUcation 


□ This sheet was received by the International Bureau on: 


Authorial officer ^ 


Authorized officer 



Ibcm FCT/RQf 134 (July 1992) 
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Applicant's or aseats file 

refieseooe number: 1533.103PCt^ 



ol applicadBi 



TBA 



INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulelSfetf) 



A, Thf iiHtn^»«^"g made below relate to the microor ganism referred to in the description on page line _5_. 

B. IDENTIFICATION OF DEPOSIT Further dqiositsaieidaitificd on an additional sfafset Bl 

Name of depositoiy institution 

Agricultural Research Culture Collection ff^RRL) 



Addcess of depository instimtion (bidudbig postal axU and country) 

1815 N. Univesrsify Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 16 September 1999 

(16.09.99) 



Accession Number 
NRRL B-30222 



C ADDITIONAL INDICAHONS (leave Monk if not appUoMe) 

E::2(KDABD'L) 



nils 



ion IS I 



ued on an additional sheet Q 



D* DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE {if the indications are not farm desiffigUdSiata) 



E. SEPARATE FURNISHING OF INDICATIONS (Ua^ebUnk^MtapptioAU) 



The indications listed below will be submitted to the intematioiial Bureau later (specify the general nature of the indications, e.g., 
"Accession Number of Deposit") 







^ This sheet was received with the mteioatioDal application 

^Es ^ 


□ This sheet was received by the Intematianal Buieauon: 


Auttionzed officer a 


Authorized officer 



Ftann PCr/R0^134 (July 1992) 
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Applicant's or agait*s file 

refoence number: 1533.103PC03 


frtftrimrinnal applicatiR fto* 

TBA 


INDICATIONS RELATING TO A DEPOSII ED MICROORGANISM 

(PCr Rule ISbis) 


1 A. TTie indications made below i^atB to the miaiKyganism 1 


B. IDENTIFICATION OF DEPOSIT 


Fmthet defiosits aie identified on an ad<fitioiiaI sheet B 


Name of depository institution 




Agricultund Research Culture Collection ^JRRL) 




Address of deposltaty institution (induding postal axU and country) 

m 


1815 N. Univesrsity Street 
Peoria, niinois 61604 
United States d Amoica 




Datcofdepont 29 S^tembw 1999 

(29.09.99) 


Acces^on Number 

NRRLB-30228 



C ADDITIONAL INDICATIONS (leave bUmk if not applicable) 



This infoitnation is continued on an additional sheet ^ 



DH5-a MCR pK184-KDABDI- 



D, DESIGNATED STATES FOR WfflCH INDICATIONS ARE MADE (tfthe indicationj are not for aU desigmaed States) 



E. SEPARATE FURNISHING OF INDICATIONS (Uavt Nmk tfna appUcabU) 

The indications listed below wiU be submitted to the intematiooal Bureau laU^ (spe^ the gaiend namre of the indications. e.g.. 
^Accession Number of Deposit") 







^Tbis sheet was received widi die intemationai a^ifdication 


□ This sheet was received by die International Bureau on: 


Authori»d officer 

1 yi^M.jfA^. f ait^ 


Authorized o£&cer 



Farm PCT/ROn34 (July 1992) 
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Applicanifs or agent* s file 

icfinaencc number: 1533.103PC0j 


International ai^ucatiwii W. 

TBA 


INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule I3bis) 


A. Tlie indicadoiis below xelate to the micnxirganism xefeoed to in the description on page „43^ line _2 


B. IDENTIFICATION OF DEPOSIT 


Partiierdqwsittaie identified on an adiBHonal sheet " 


Name of depository institution 




Agricultural Research Culture Collectioii (NRRL) 




Addiess of depositoiy institution (including postal code and country) 


1815 N. Univcasity Street 

Peoria, Dlinois 61604 

United States of America 




Date of deposit 16 December 1999 

(16.12.99) 


Accession Number 

NRRLB-30233 


C ADDrnONAL INDICATIONS (Iflav«Wkif^ Tliisinfiiiinationisa>ntiniiedonanadjfitionalsheet □ 



DHS-a MCR pDelia2-KD ABdL 



D. DESIGNATED STATES FOR WfflCH INDICATIONS ARE MADE ((ritofridtogto»flrgfu»rj^ 



E. SEPARATE FURNISHING OF INDICATIONS o^bM^^nctappUcabU} 

•ne indications Usted below wiU be submitted to the international Bureau later (spec^ the general nature of the indications, e.g„ 
"Accession Number of Deposit") 







^Ibis sheet was received with the inteniatioiial j^plicatitm 


□ This sheet was received by the International Bureau on: 


Authorized officer 


Authorized officer 



Rwn PCT/R(yi34 (July 1992) 
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Ai^canf s or ageaf s file 

r^rcnoc munberZ 1533.l03Pcui 



Intcmatioiial applicatiSS 

TBA 



nWICATIONS RELATING TO A Dm>SITEa) MICROORGANISM 

(PCX Rule 13few) 



A, The indications made below relate to the micnxggamsm lefened to in the description on page line _S_. 



B. IDENTIFICATION OF DEPOSIT 



Further deposits ue identified on an additional sheet ^ 



ition 



Name of depository 

Agricultural Research Culture CoUectioii (NRRI^ 



Addsess of depo^toiy i 



ion (indtkUng postal code and country) 



1815 N. University Street 
Peoria, nUnois 61604 
United States of America 

Date of deposit 16 December 1999 

(16.12.99) 



Acoessiott Kumber 

NRRLB-30234 



C ADDITIONAL INDICATIONS (Umw bUmk if not applicabU) 

: : pDeliA2.KDABdL 



Thi^ information is continued on an additional sheet ^ 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (if the indlcatkw are not for aU desisnated States) 



E. SEPARATE FURNISHING OF INDICATIONS (leave bbak if not (wpUcet»U) 



The indications Hsted below will be submitted to die international Bureau later (spec^ the general nature oftheindkaUons, e.g,, 
"Accession Number of Deposit") 







^^bis sheet was received widi the international application 


□ This sheet was received by the International Buieau on: 


Authonzed officer 


Authorized officer 



Fonn FCr/RO^m (July 1992) 
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Applicant's or agent'j file 

idiBteiioe mimberl 1533.103PCC^ 


Internatkmal applicati<S?^ 

TBA 


INDICATIONS RELATING TO A DEPOSFTED MICROORGANISM 

(PCX Rule I3to) 


A. The indications made below relate to the tmooorganism lefened to in the description on page line „6_. 


B. IDENTmCATION OF DEPOSIT 


Fuithd- deposits are identified on an additional sheet ^ 


Name of depositsKy instilution 




Agricultuial Research Culture Collection (NRRL) 





Address of dqiositoiy i 



(including postal code and country) 



1815 N, University Street 
Peoria, Illinois 61604 
United States of America 



Date of deposit 16 December 1999 

(16.12,99) 


Aooession Nuiid)er 

NRRLB-3023S 


C ADDmONAL INDICATIONS (/eovielriaiiilr^ROf^V'P Ttusinfiinnation is oontiraied on an additional sheet □ 



: : pDelia2-KDABdl 



D- DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE fcT/fttf wdSKSzir^anf 



E* SEPARATE FURNISHING OF Il^ICATIONSrw/^«r«ii<w^^ 



Ibe indications listed below will be submitted to the international Buxeau later (spec^ the general nature of the Indications, cg^ 
"Accession Number of Deposit") 



For lecdving OCBce use only 



This sheet was received with the intonational application 



I 



Authorized officer 



For bitemational Bureau use only 



n This sheet was received by the biternational Bureau on: 



Authorized officer 



R)nnPCT/RO<134(JaIy 1992) 
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Apidicanf s or agcaf s file 

lefermoe numberl 1S33.103PCX^^ 


Inlcmatioiial appncatunrTic^ 

TBA 


INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule X3bi5) 


A. The indications niade below relate to tibc mioooisaiiism lefeired to in the desaiption on page _8_, line _9_. 


B. IDENTIFICATION OF DEPOSIT 


FmtiiCT deposits are identified on an additional sheet IS 


Name of depositoiy institution 




Agricultural Research Culture Collection ONRRL) 




Address of depository instibition (including postal code andcountry) 


1815 N. University Street 
Pe<»ia, niinois 61604 
United States of America 




Date of deposit 16 December 1999 

(16.12.99) 


Accession Number 

NRRLB-30236 



This infonnation is continued on an additional sheet O 



: ipKlS^-KDABdl. 



D, DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE V^tixe indications are not for aU d€sisnated States) 



E. SEPARATE FURNISHING OF INDICATIONS fleoi* Won* ir#irt^^ 



The indications Ustcd below wiU be submitted to die international Bureau later (specify the general nature oftfie mdicatUms, e,g,, 
"Accession Number of Deposit") 







-J— 

iCr This sheet was received widi the international application 


□ This sheet was received by the Intemational Bureau otu 


Authofized offico^ 


Authorized office 



Boon JPCr/RO/134 (July 1992) 







I/IJoUU/«95d17 




66.11 




Applicanf s or agcaf s file 


mtEmatiooai apfuicatiia m. 




reference number: IS33.103PCUw 


TBA 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCTRulclSfeis) 



A. Tlie indicatioiis made below xeiate to the micioorgaiusm lefened to in the desaiptioii on page _8_, line 

B. IDE34TIFICATION OF DEPOSIT Farther deposits arc identified on an additional sheet 83 
Name of depositoiy institution 

Agricultural Research Culture Collection (NRRL) 

Address of depository institution (including postal code and country) 



1815 N. Univ^ty Stteet 
Peoria, Illinois 61604 
United States of America 



Date of deposit 16 December 1999 

(16.1259) 



Accession Number 

NRRLB-30237 



C ADDlTlONALINDlCATlOt& (leave Nank(f not This iafoimation is contiaoed on an additional sheet Q 

: :pDelia2-KDABdL 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (^Oie indkations m not for dtdgnat^ States) 



E. SEPARATE nJRNISHING OF INDICATIOf^ficrnvMi^ 



The indications hsted below will be submitted to tiie mtcamaHnoal Bureau later (spec^ the general nature cfAe indications, e.g., 
"Accession Number of Deposit") 





^^"""^ For hileiuaLioual Buieau use only — — i 


^^^a& sheet was recdved with the international application 


n This sheet was recetved by the hitematioaal Bureau on: 


Authorized ofiSon 


Audiorized officer 



Fbnn PCT/RO/134 (July 1992) 
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Applicant's or age&f s file 

reference number: lS33.103PCv.y- 


Intanational a{q)licati JSTiC 

TBA 


INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13&M) 


A. The indications made below relate to the miciDocganisn] 


I lefened fo in the description on page line _23_. 


B. IDENTIFICATION OF DEPOSIT 


Fuitfaer deposits are identified on an additional sheet ^ 


Name of depository institution 




Agncultural Research Cultuie Collection (NRRL) 




Address of depositary institution (including postal code and country) 


1815 N. University Street 
Peoiia^ Illinois 61604 
United States of America 




Dateofd^HMit 31 October 2000 

(31.10.00) 


Accession Nmnber 

NRRLB-30359 



C ADDITIONAL INDICATIONS (leave blank if not applicable) infonnation is continued on an additional sheet D 



: :pD£Ua2'KDABHPlL 



D. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE (^diei»dicasians are notfor all designaSed States) 



£. SEPARATE FURNISHING OF INDICATIONS (lemv btankffnotappUaible) 



The indications listed below will be submitted to the IntematioDal Bureau later (spec^ the general nature of the indicaUoni, e.g^ 
"Accession Number ofDeposW) 



For recdving OfBoe use only 



sheet was received with the international application 



Authorized officer 




For bitemational Bureau use only 



□ This sheet was received by the intematioDai Bureau on: 



Authorized officer 



Fofin PCT/RCV134 (My 1992) 
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• 


Applicant's or agcntf a file 




kitctnatioiial appHcati^flE- 




referaice numberl 1S33.103PCC./ 




TBA 





INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCX Rule 13bis) 



A, The indications made bdow relate to the miaDO tganisni irfened to in ttie description on page JL, line JL.. 

B. IDEmiFICATION OFDEPOSIT deposits are identified on an additional sheet □ 
Name of depository institution 

Agdcultural Research Cul tuie Collection (NRRL) 

Address of depoaitoiy institutian (mdudifi^g 

1815 N- University Street 
Peoria, Illinois 61604 
United States of America 



DatBofdeposit 31 October 2000 

(31.10.00) 


Accession Number 

NRRLB-30360 


C. ADDmONAX. INDICATIONS (leave blank ffnot appUcabU) infonnation is continued on an additional sheet □ 



: :pDE[ia2FC5-KDBHL 



D. DESIGNATED STATES FOR WmCH INDICATIONS ARE MADE r//ft« ifid^^ 



E. SEPARATE FURNISHING OF INDICATIONS (Um bUmk if not uppUaMe) 



The indications listed below will be submlBed to the intemadonal Bureau later (spec^ the general nature of the indioatUms, e,g^ 
''Accession Number of Deposit") 







^Tbis sheet was received with the intemacional application 


□ This sheet was received by the Inteniationai Bureau on: 


AuthoBzed officer 


Authorized officer 



Fam FCr/R0^134 (July 1992) 
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Whatls Claimed Is: 

1. An isolated polypqitideconqinsing the amino acid sequoice of SEQ ED 
NO:2. 

2. An isolated polynucleotide molecule comprisiag a nucleotide sequence 
encodmg the polypeptide sequence of claim 1 . 

3. The isolated polynucleotide molecule of claim 2 comprising a nucleic 
add having the sequence of SEQ ID NO: 1 . 

4. A vector comprising the isolated polynucleotide molecule of claim 2. 

5. A host cell comprising the vector of claim 4. 

6. A method conq>rising: 

(a) transforming a Corynebdcterium species host cell with the 
polynucleotide molecule of claim 2, wherein said isolated polynucleotide 
molecule is mtegrated into said host cell's chromosome thereby increasing the 
total number of said amino add biosyntfaetic pathway genes in said host cell 
chromosome, and 

(b) selecting a transformed host cell. 

7. The method of claim 6 further comprising screening for increased amino 
add production. 

8. The method of claim 6 wherein said polynucleotide molecule further 
comprises at least one of the following: 

(a) a nucldc add molecule encoding a Corynebacteriwn species 
lysine pathway as6? amino add sequence; 
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(b) a nucldc acid molecule encoding a Corynebacterium species 
lysine pathway dapA amino acid sequence; 

(c) a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway dapB amino acid sequence; 

(<Q a nucleic acid molecule encoding a Corynebacterium species 
lysine pathway ddh amino acid sequmcQ and 

(e) a nucleic acid molecule ^coding a Corynebacterium species 
lysine pathway 7^^^^^ amino acid sequence 

(Q a nucleic add molecule encoding a Corynebacterium specie 
lysine pathway lysA amino add sequence; and 

(g) a nucldc add molecule oacoding a Corynebacterium species 
lysine pathway ORF2 amino add sequence. 

9. The method of daim 8 further comprising screening for inoreased amino 
add production. 

10. The method of claim 6, wherein said isolated polynucleotide molecule 
further con^irtses at least one of the following: 

(a) a nucldc add molecule encoding the asd amino add sequence of 
SEQIDNO:4; 

(b) a nucldc add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(c) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8; 

(d) a nucleic acid molecule oicoding the aimno add sequence 
ofSEQIDNOrlO; 

(e) a nudeic add molecule oicoding tiie 'I^ksA amino add sequence 
ofSEQIDNO:21; 

(f) a nucleic add molecule encoding the lysA amino add sequence 
ofSEQIDNO:14; 
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(g) a nucleic acid molecule encoding the ORF2 amino acid sequence 
ofSEQIDNO:16. 

r 

1 1 . The mediod of claim 6, wherein said isolated polynucleotide molecule 
fiutha' conoprises flie following: 

(a) a nucleic add molecule encoding die asd amino add sequence of 
SEQIDNO:4; 

(b) a nucleic add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(c) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8;and 

(d) a nucleic acid molecule encoding the 0RF2 amino acid sequence 
ofSEQIDNO:16. 

12. The method of claim 6, wherein said isolated polynucleotide molecule 
furtho- conqnises the following: 

(a) aniicleicacidiiioleculeencoding1]iear</aminoacidsequenceof 
SEQIDN0:4; 

(b) a nucleic .add molecule encoding the dapA amino acid sequmce 
ofSEQIDNO:6; 

(c) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8; 

(d) a nucldc add molecule encoding the ddh amino add sequence 
of SEQ ID NO: 10; and 

(e) a nucldc add molecule encoding the ORF2 amino add sequence 
ofSEQIDNO:16. 

13. The method of claim 6, wherein said isolated polynucleotide molecule 
further conqirises the following: 

(a) a nucldc acid molecule encoding the asd amino add sequence of 
SEQIDNO:4; 
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(b) a nucldc add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(c) a nucldc add molecule mcoding flie dapB amino add sequence 
ofSEQIDN0:8; 

(<Q a nucldc add molecule encoding Ifae^i/A amino acid sequence 
ofSEQIDNO:10; 

(e) a nucldc add molecule ^coding the amino add sequence 
ofSEQIDNO:21;and 

(f) a nucldc add molecule encoding the ORF2 amino add sequence 
ofSEQIDNO:16. 

14. The meAod of claim 6, wherein said isolated polynucleotide molecule 
further comprises the following: 

(a) a nucldc add molecule encoding the asd amino add sequmce of 
SBQIDNO:4; 

(b) a nucldc add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(c) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8; 

((Q a nucldc add molecule encoding the i/c/A amino add sequence 
ofSEQIDNO:ia, 

(e) a nucleic add molecule mcoding the lysA amino add sequence 
of SEQ ID NO: 14; and 

(f) a nucldc add molecide encoding the 0RF2 amino add sequence 
ofSEQIDNO;16. 

1 5. The method of claim 6 further comprising: 

(a) growing said transfomied host cell in a medium; and 

(b) purifying an amino add produced by said transformed host cell 
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16. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; and 

(b) at least one additional Corynebacterium spedes lysine pathway 
gene selected from the group consisting of: 

5 (i) a nucleic add molecule encoding an asd polypeptide; 

(ii) a nucleic add molecule encoding a dapA polypeptide; 

(iii) a nucldc add molecule encoding a dapB polypeptide 

(iv) a nucldc add molecule encoding a ddh polypq)tide; 

(v) a nucldc add molecule ^coding a 'lysA polypeptide; 
10 (vi) a nucldc add molecule encoding a lysA polypeptide; and 

(vii) a nucldc add molecule encoding an ORF2 polypeptide. 

17. Hie isolated nucldc acid molecule of claim 16, whodn: 

(a) said asd polypeptide is SEQ ID NO:4; 

(b) said dapA polypeptide is SEQ ID NO:6; 
IS (c) said^opB polypeptide is SEQ ID NO:8; 

(d) said<2efA polypeptide is SEQ ID N0:10; 

(e) said 'lysA polypeptide is SEQ ID N0:21; 

(f) said polypeptide is SEQ ID NO:14; and 

(g) said Oi?F2 polypeptide is SEQ ID NO: 16. 

20 18. An isolated polynucleotide molecule conprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic acid molecule oicoding tiie asd amino acid sequence of 
SEQIDNO;4, 

(c) a nucleic add molecule encoding the dapA amino add sequoice 
25 ofSEQIDNO:6; 

(d) a nucldc add molecule encoding the dapB amino acid sequence 
of SEQ ID N0:8-, and 

(e) anucldc add molecule encoding the ORF2 amino add sequence 
of SEQ ID NO: 16. 
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19. An isolated polynucleotide molecule compising: 

(a) the polynucleotide molecule of claim 2; 

(b) anucldc acid molecule encoding the asd amino addsequence of 
SEQIDNO:4; 

(c) a nucldc add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(d) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8; 

(e) a nucldc add molecule encoding the ddh amino add sequence 

of SEQ ID NO: 10; and 

» 

(f) a nucldc add molecule encoding the ORF2 amino add sequence 

ofSEQIDNO:16. 

20. An isolated polynucleotide molecule comprising: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic add molecule ^coding the asd amino add sequence of 
SEQIDN0:4, 

(c) a nucldc add molecule encoding the dapA amino add sequence 
ofSEQIDNO:6; 

(d) a nucldc add molecule encoding the dapB amino add sequence 
ofSEQIDNO:8; 

(e) a nucldc add molecule encoding flie ddh amino add sequence of 
SEQ ID NO: 10; 

(Q a nucldc add molecule encoding tfie 'lysA amino add sequence 
ofSEQIDNO:21;aad 

(g) a nucldc add molecule encoding the ORF2 amino add sequence 
of SEQ ED NO: 16. 
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21 . An isdated polynucleotide molecule conpismg: 

(a) the polynucleotide molecule of claim 2; 

(b) a nucleic add molecule encoding the asd amino add sequence of 
SEQIDNO:4; 

(c) a nucldc add molecule encoding the dapA amino add sequmce 
ofSEQIDNO:6; 

(d) a nucldc add molecule encoding the dlop^ amino add sequence 
ofSEQIDN0:8; 

(e) a nucldc add molecule oicoding the ddh amino add sequence of 
SEQIDNOilO; 

(f) a nucleic add molecule encoding the lysA amino add sequence 
ofSEQIDNO:14;and 

(g) a nucldc add molecule encoding the ORF2 amino add sequence 

ofSEQIDNO:16. 

22. The isolated polynucleotide molecule of claim 18 comprising pK184- 
KDAB. 



23. The isolated polynucleotide molecule of claim 20 comprising pDll- 
KDABH'L. 



24. The isolated polynucleotide molecule of claim 21 comprising pD2- 
KDABHL. 



25. A vector comprising the polynucleotide molecule of claim 16. 

26. A host cell comprising the vector of claim 25. 

27. The host cell of claim 26 wherdn said host cell is a Brevibacterium 
selected from the group consisting of Brevibacterium flavum NRRL-B30218, 
Brevibacterium flavum NRRL-B30219, Brevibacterium lactofermentum 
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miSl^B3(mO,Bre\nbacter^ BrevAaderium 
lactofermentum NRRL-B30222, Brevibactenum flavum NRRL-30234 and 
Brevibacterium lactofermentum NRRL-3023S. 

28. The host cell of claim 26 wherein said host cell is Escherichia coli DHS 
a MCR NRRL-B30228. 

29. The host cell of claim 26 wherein said host cell is a C. glutamicum 
selected from the group consisting of C. glutamicum NRRL-B30236 and C. 
glutamicum NRRL-B30237. 

30. A method of producing lysine coirpising culturing the host cells of 
claim 5 wherein said host cells comprise one or more of: 

(a) increased enzyme activity of one or more l^ine biosynthetic 
pathway en:^mes compared to die genetically unaltered host cell; 

(b) one or more copies of each gene encoding a lysine biosynthetic 
pathway emyme; and, 

(c) alteration of one or more transcription factors regulating 
transcription of one or more genes encoding a lysine biosynthetic pathway 

enzyme, wherein said host cell produces lysine in said culture medium. 

» 

31. The metfiod of claim 30 wherein said increased enzyme activity 
comprises overexpressing one or more genes encoding one or more lysine 
biosynthetic pathway enzymes. 

32. The method of claim 3 1 wherein said one or more genes are operably 
linked direcdy or indirectly to one or more promoter sequences. 

33. The method of claim32 wherein said operably linkedpromoter sequences 
are heterologous, endogenous, or hybrid 
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34. The method of claim 33 whanduisaidpromotersequences areoneormore 
of: a promoter sequence from the 5* end of genes endogenous to C glutamicum, 
a promoter sequence from plasmids that rq)Iicate in .C glutamicum, and, a 
promoter sequence from the genome of phage which infect C gliUamicum. 

■ 

35. The method of claim 33 34 wherein one or more of said promoter 
sequences are modified 

36. The mefliod of claim 35 wherein said modification con^nises truncation 
at die 5* end, truncation at the 3* end, non-terminal insertion of one or more 

* 

nudeotides, non-terminal deletion of one or more nucleotides, addition of one or 
more nucleotides at the 5* end, addition of one or more nucleotides at the 3' end, 
and, combinations thereof. 

37. The method of claim 30 wherein said increased enzyme activity results 
from tibie activity of one or more modified lysine biosynthetic pathway enzymes 
wherein said enqme modification results in a diange in kinetic parameters, 
allosteric regulation, or both, compared to the enzyme lacking said modification. 

38. The method of claim 37 wherein said change in kinetic parameters is a 
change in K„„ or both. 

39. The method of claim 37 wherein said change in allosteric regulation is a 
change in one or more enqrme allosteric regulatory sites. 

40. The method of claim 37 wherein said modification is aresult of a change 
in the nucleotide sequence encoding said enzyme. 

< 

41. The method of claim 40 wherein said change in said nucleotide sequence 
is an addition, insertion, deletion, substitution, or a combination thereof, of one 
or more nucleotides. 
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42. The method of claim 30 wherein said altanatioQ of one or more 
transciiptioii factors comprises one or more mutations in transcription inhibitor 
proteins, one or more mutations in transcription activator proteins, or both, 
whoein said one or more mutatioiis increases transcription of the target 
nucleotide sequence compared to the transcription by said one or more 
transcription factors lacking said alteration. 

43. The method of claim 42 wherein said one or more mutations is a change 
in said nucleotide sequence encoding said transoiption factor. 

44. The method of claim 43 wherein said change in said nucleotide sequence 
is an addition, insertion, deletion, substitution, or a combinaticm thereof, of one 
or nlore nucleotide. 

45. An isolated polypeptide, wherein said polypeptide comprises an amino 
acid sequence having at least 95% sequence identity to the amino add sequoice 
ofSBQIDNO:19. 

46. The pol^)eptide of claim 45, whorein saidpolypq)tide has the amino acid 
seqaoice of SEQ ID NO: 19. 

* 

47. An isolated polynucleotide molecule conqirising a nucleotide sequence 
encoding the polypeptide of claim 45. 

48. The isolated polynucleotide molecule of claim 47 comprising a nucleic 
acid having the sequence of SEQ ID NO: 18. 

49. A vector comprising the polynucleotide molecule of claim 47. 

50. A host cell comprising the vector of claim 49. 

51. The host cell of claim 50 whetein said host cell is NRRL B30360. 
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52. A method coir^nising: 

(a) transformiDg a Corynebacterium species host cell with &e 
polynucleotide molecule of claim 47, and 

(b) selecting a transformed host cell. 

53. An isolated polypeptide wherein said polypeptide conqxises a 
polypeptide having at least 95% sequence identity to the amino add sequence of 
SEQlDNO:2L 

54. The polypeptide of claim 53 having the amino acid sequence of SEQ ID 
NO:21. 

55. An isolated polynucleotide molecule conqirising a nucleotide sequence 
encoding tfie polypeptide of claim 53. 

56. The isolated polynucleotide molecule of claim 55 comprising a nucleic 
acid having the sequence of SEQ ID NO: 20. 

57. A vector comprising the polynucleotide molecule of claim 55. 

58. A host cell comprising the vector of claim 57. 

59. The host cell of claim 58 wherein said host cell is selected fiom the group 
consisting of NRRL B30218, hfRRL B30220 and NRRL B30222. 

60. A method comprising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 55, and 

(b) selecting a transformed host cell. 
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61. The isolated polynucleotide molecule of claim 2 fuitha- comprising a 
promoter sequmce where said promote sequence has at least 95% sequence 
identity to SEQ ID NO:17. 

62. The polynucleotide of claim 61 where said promoter sequence has tfie 
nucleotide sequence of SEQ ID NO: 17. 

63 . The isolated polynucleotide molecule of claim 6 1 wherein said promoter 
is operably directly linked to the LysA gene. 

64. A vector con^rising the isolated polynucleotide of claim 6 1 . 

65. A host cell comprising the vector of claim 64. 

66. The host cell of claim 65 wherein said host cell is NRRL B30359. 

67. A method con[^7rising: 

(a) transforming a Corynebacterium species host cell with the 
polynucleotide molecule of claim 61, and 

(b) selecting a transformed host cell. 
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Kinase 

Aspartate- 
semiatdehyde 
dehydrogenase 
(asd) 

Oihydrodi- 
ptcolinate 
synthase 
(dapA) 

Dihydrodi- 
picofinate 
reductase 
(dapB) 

Tetrahydrodl- 
picoiinate 
succinyfase 
(dapD) 

N-Succinyl-amino- 
Ketopimelate 
transaminase 
(dapC) 

N-Sucdnyl-diamlno- 
ptmelate 

desucctnylase 
(dapE) 

Oiaminopimelate 

epimerase 

(dapF) 



L-AspKaatc 
L^Aspartylphosphate 



L^parfatesemialdehydd*- 

L-MetHionine 
L-2t34Dihydrodlpicormate 



L-Threon!ne 



L4soleucIne 



L-PIperideine-2,6-dicarboxyIate 

/ \ 



N-Suc(^nyK2-amino- 
6ketop!melate 



i 



N-SuccInyl-2,6-L.L- 
diamtnopimelate 



L»L-Diaminoplnielate 



DIaminopimelate 

dehydrc^enase 

(ddh) 



\ 



Diamlnopimelate 

. decarboxylase 
(lysA) 

Pennease 




+- Call Wall 
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Nuclootide 



daquenee of ATCC21S29 ask (SBQ ID M0;1} 



1 

!il 
10 1 
Ibl 
201 
251 
301 
351 
401 
451 
SOI 
551 

qoi 

651 
701 
751 
801 
B51 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 



GTGCOGCTGC TCGTACAGAA ATATGOCGOT TCCTCGCTTG ACACTGCCGA 
ACGCATTAGA AACGTCGCTG AACGGATCGT TGCCACCAA6 AAGGCTGGAA 
ATCATGTCGT GGTTGTCtGC rOCGCMXCQ GAGACAGCAC GOATGAACTT 
CTAGAACTTG CAGCGGCAGT GAATCCCGTT CCGCCAGCTC GTGAAaTGGA 
TATGCTCCTG ACTGCTGGTG AGOGTAtTTC TAACGCTCTC GTCOCCIlTGG 
CTATTGAGTC CCTTGGOGCA GAAGCTCAAT CTTTCACTQG CTCTCAGGCT 
GCTGTOCTCA CCACCGAGGG GCACGGAAAC GCACSGCATtG tTGACGTCAC 
ACCGGGTCGT GTGOGTGAAG CACTCGATGA GGGCAAGAtC tQCkTOSnG 
CTGGTTTTCA GGGTGTTAAT AAAGAAACCC GOGATGTGAC CAOGTTGGGT 
CGTGGTGGTT CTGACACCAC tGCACTtGCG TTGGCAGCT6 CmOMOGC 
TGATGTGTGT GAGATTtACT CGGACGTtCA COGTGTGTAT AOCGCIGACC 
CGCGCA!rcGt TCCTAAT6CA CAGAAGCTGG AAAAGCTCAG CTTOGAAGIUl 
ATGCTGGAAC TTGCTOCTCT TGG.CTCCAAG ATTtTGGTGC TGOQCNStQX 
TGAATACGCT CGTGCATTCA ATGTGCCACT TCGCOTACGC TCGTCTTA3CA 
GTAATGATCC CGGCACTTTC ATTOCXSGCT CtATGGAGGA TATTCCTGT6 
GAAGAAGCAG TCCTTACOGG TGTOGCAAOC GACAACTCOG AAGGCAAAGT 
AACCGTTCTG GGT A T l TOCG ATAAGCCAGG CGA6GCTGO& AAGGTmOC 
6TGC6TTGGC TGATGCAGAA ATCAACATP6 ACATGGTTCT GCAGAACGTC 
TCCTCTGTGG AAGACGGCAC CACOGACATC ACGTTCACCT GCCXTCGCGC 
TGACGGACGC CGTGCGWGG AGATCTTGAA GAAGCTTCAG GTTCAGGGCA 
ACTGGACCAA rGrGCTTTAC GACGACCAGG TCGGCAAAGT CTCCCTCGT6 
GGTGCTGGCA TGAA6TCTCA CCCAGGTGTT ACCGCAGAGT TCAT6GAAGC 
TCTGCGCGAT GTCAACGTGA ACATCGAATT GATTTCCATC TCT6AGATCC 
GCATTTCCGT GCTGATCC6T GAAGATGATC TGGAT6CTGC TGCAOTPGCA 
TTGCATGAGC AGTTCCAGCT GGGCGGCGAA GACGAAGCCG TCGTTTATQC 
AGGCACCGGA CGCTAA 
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fVAe.lVATICKAOMOwvvvc 

X81 1 , ^ 

PPARBHDMLI.IAGSRI8HAI. 

"■ ' ' • • ♦ 300 

VAMAIESLGAEAQSFT 



360 



-^nAlESLGAEAQSFTGSQA 

G VfcTTBRHCHAniVpVTPG a 

4 .. I ^20 

VReA&DEGKICrVAGFQGVM 

' , " • ' — I ■■ .•. 480 

KETRDVTTIGRGGSOTIAVA 

" ~* • ► 540 

LAAAlHAOVCEItSovOGv'y 

.4 < ,^ ^ 

TAOPRivpHAQKLeKLSPSE 
HIELAAVQSKII. viRsvEYJl 

4 > , ^ 

—4 — . + 700 

l AGSMEDlt vprit. 

761 

r> u- n _ _ * 



0 



°KSEAKVTVI.C180KPCE 



■* 840 

«^ A 
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^ ^ 

K V ? R A L A o A ^ I t: i ? v L ' :i i: V 

- « ^ 4 , ^ ^ 

S SVeOGTTOZT FTCPRAOGR 
CGTGCG^ 

961 - , 

1020 

R A M E I L K iCX.QVQGMHTNVX,Y 

m 

~ — - ♦ +— ' '+ 1080 

0OQVCKVSLVG AGKKSHP 6V 

+_ J ^ ^ ^ ^ ^^^^ 

T,ABFMEALRCVM 7KIE1.:S-1 • 

+ ..^ —■ ' I- f + 1200 

SBIRlSV.tlF. E DDLDAAARA 

~ " ' " " — •«— — — --+_^ . — J 1260 

l.HEQPQLGdEOeA V.V Y A S T G 

CGCTAA 
1261 ,266 
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NucleotdLdA 



sequteCA of AT0C2IS29 astl {StQ ID UO:jg) 



1 ATGACCACCa 

SI GCGCAOCTTT 

101 TTGCiPTCCCC 

151 ATCGBGGTAG 

201 cgacgittgcg 

251 cactgtttgc 

301 cgcaAggacg 

3SX caagcattcc 

401 TGGCTGCAAt 

451 AAGCTTCACG 

501 TGTCGAAACC 

551 AGTTCGtCCA 

601 GTTTCCCCAA 

651 TGACGGCACC 

701 GCAAGATTCT 

751 GTGCCGCSTrr 

801 GGCAAtCACC 

951 TCGAGCTTGT 

901 TCXXrrCGTTC 

951 TCTGGTTCTC 

100 1 ACAOcarrcA 



TCGCAGTTGT 
TTGOAAGAGC 
GCGTTCCGCA. 
AAGACATTAC 
TTGTTCTCTG 
TGCTGCAGGC 
ACGAGGTTCC 
CTGGTCftAGG 
GCXIAGTOCTG 
TTTCCTCTTA 
TTGGCAAAGC 
TGAtGGACAG 
TCGCTTACAA 
TTCGAAACCG 
CGGCCTCXXA 
TCACCGGCCA 
GTCOAGCAGG 
CGACGTCCCA 
GACCCATCCG 
GTCGTATCTG 
GATTGCTGAG 



TGGTGCAACC 
GCAATTTCCC 
GGCCGTAAGA 
TCAGCCAACC 
CTGG/VGGCAC 
GOGACTGTTG 
ACTAATCCTC 
GCftTTATTGC 
AAGCCACTOC 
CCAGGCTGTT 
AGGTTGCTGC 
GCT6CTGACG 
CGTGrrGCCA 
ACa=JU;AC-CA 
GACCTCAAGG 
CACGCTGACC 
CGCftGGAGAT 
ACCCCAC7TG 
TCA6GACTCC 
GGGtiTAACCT 
CTGCTGGTTA 



<5°CCAGGT0e GOCAGGTTAT 



TTGAATTOGG 

GAGGAGTOCX: 
CGCTTCCAAG 

TCtGAGQTGA 
GAATCCTAAC 

tOCGGTTCTG 
AGnCTOGAC 
CAGGCOATGT 
TTCGCCGGAA 
5AA6CTGCGC 
"XrrCAGGCAC 
ATTCACGCCG 
CTTGCGTGQC 
CAGCTGCCGG 
ACTGTCGACG 
7CGCAAGGGC 
AGTAA 



TCAAQG6CAT 



AOCCTTCCGA 
TGCAOCAOCA 
TGGTC1CT6TA 
GTCTTGCAGG 
CACAACGTT6 
CGGAOCCTAC 
ACCTCGTCGA' 
AACGAATCCC 
CTGCGTCCGC 
AATTC6ACAA 
GCrrCAGGCG 
CATTGACGA^ 
ACAACCGCGC 
GCAGCACTGA 
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1 • • * ^ — , eo 

NTTIAVVCATC0VCOVHRT F 

iBcanrPAOTvarrAfiFRsj^ 

, — , . iflo 

CUfclCPACTtrcVeOXTOAt 




CESCKCroVALrSACCtA 
QYAeVrAAACATVVDJIssAV 

ccc iuiaao caoo^ccmwact M t cLKiwu ^ 

tKDOCVPLXV StVNPSOKOS 
tVKCXXANPltCTTKAAHPVL 



K>L«pAACt«rLBVSStQAW 

SCSCIJJCWCTUAKQVAAV CO 

KMVBPVROCOAAOACOifC 

^TTTCCCCWLtCGCnMIWWmxr^^ 

. ♦ , ^ ^ ^ ^ ^ 

VSPIAtHVLPrACNLVPU C T 

'* ■ - - I - "4- >20 



710 

OLKVSCtCVfivPV 



F T C a t L T 



... ». — » — . — - — t 

ASCVtl.vq v»TPL»,»,0 I oe 

P.J SA 
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TCCCTCGTTGGAOGCATCCGTCAaSACTCCACTGTCGACCACM 
001 ♦ 4-— « *- ♦ 9GQ - 

SLVGRIRQDSTVOONFlCLV L 

GTCGTATCTGGCGATAACCTTCGCAAGGGCGCAGCACTCI^ 
961 — —"1. -^•.—4— r — ^^•^ — ^Q2Q 

V.V S G O H LRKGA ALH T t Q I A E* 

CtGCTGGTTAAGXAA 
1021 4 1035 

I. Ir V K * 
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MuelootilclQ. saqvenoe or dapA (SBQ lo HO: 5% 

1 ATGAGCACAG GTTTAKAGC TAAGACCGGA GTAfiAGCACT TCGGCACCXTT 

Si TGCAGTACCA ATGGTTACTC CATTCACGGA. ATCOGCAGAC AtCGATATCG 

101 CTGCTGGCCG C6AAGTOGCG OCTTATTTGG TTOVTAAOGG CmGAnCT 

151 TTGCTTCtCG OGGGCACCAC TGGTGMTCC CCAAOQVCMi OQGCOOCIGA 

201 AAAACTA6AA CTGOTCAAGG CCGTTCCTGA GGfU^GTTGGG GAtCGQGOGA 

251 AGCTCAXCGC CGGTGTCGG^ ACCMCAACA CGCGGACAIC tGTQCAACTT 

301 CCGGAAGCTG CTGCTTCTGC ttSGOGCAOAC GGOCTTTTAS TTGTAACTCC 

351 TTATTACTCC AASCXGACCC AAGAGGGATT GCTGGOQCAC TTCGGTGCAK 

401 TTGCTCCAGC AACACAGGTT CCAATTTGTC TCXATGACftT TCCIGGTCGG 

451 tCAGGTAWC CAATTGAAtC TGATXCCATG AGAOttCTGA GTGAATTACC 

501 TACGATTTTG GCGGTCARGG ACGCCAAGGG TGftOCTOGTT CCAGCCACGT 

551 CATTGATCAA AGAAACGTCA CTTGCCTGGT ATTCAGGOGA TCACXCACTA 

601 AACCTTGrrr GGCri G CTTT GGGCGGATCA GGTTTCATrr OCGTAATTGG 

651 ACAT(^:»GCC CCCACAGCAT tACGTGAGTT CTACACRAGC TTCGAGGAAC 

701 GCGACXrrCGT CXXTCCGCGG GAAATCAACG CCAAACTATC AOCGCTGGTA 

151 GCTGCCOVAG GTCGCTT6GG TGGAGTCAGC TTGGCAAAAG CTGCTcTGCG 

801 TCTGCAGGGC ATCAACGTAG gagatcctcg acttccaatt atggctccaa 

«5l atgagcagga acttgacgct ctccgagaag acatgaaaaa agctggagtt 

901 CTATAA 
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SEQ ID:6 Amino acid sequence of dap A 

ATGAGCACAGGTTTAACAGCTAAGACCXI^ 
X ^ +- + +^ + 60 

MS TGIiTAKTGVEHFGTVGVA 

ATGGTTACTCCA^TTCACGGAATCaSGAGAa^TC 
6X + + + + + + 120 

MVTPFTESGDIDIAAGRBV A 

GCTTATTlXXSTTGATAAGGGCnrZX^^ 
121 + + + + + + 180 

AYIiVDKGLDSLVXiAGTTGSS 

CX:AACGAaUU:CGCCGCTGAAAAACTAGAACI^^ 
181 + + + + + + 240 

PTTTAAEKLBLIiKAVRBBVG 

GATCGGGGGAAGCTCATCGCX^GGTGTCXSGAAC^^ 
241 + + + + + + 300 

DRAKLIAGVGTNNTRTSVEL 

GCGGAAGCTGCroCITCTGCTGGCGCAGAaS^ 
301 + + + + + + 360 

AEAAASAGADGLIiVVTPYYS 

AAGCCGAGCa^GAGGGATTGCTGGCGCAOTPCGGT^^ 
361 + + + + + + 420 

KP SQEGLIjAHFGAXAAATEV 

CaUVTTTGTCTCTATGAa^^ 
421 + + + + + + 480 

PICLYDIPGRSGIPIESDTM 

AGAC»CCIX3AGTGAATTACCTACGATTTTG^ 
481 + +- + + + + 540 

RRXiSEIiPTXXiAVKDAKGDLV 

GCAGCCACGTCATTGATOVAAGAAACGGGACTTGCCrC^ 
541 + + + + + + 600 

AATSLIKETGLAWYSGDDPIi 

AACCTTGTTTGGCTTGCTTTGGGCGGATC^^ 
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601 + + + + + + 660 

NLVHLALGGSGPISVXGHAA 

(XCACAGCATTACGTGAGTTGXACACA^ 
sex + + + + + + 720 

PTAIjRBIiYTSFBBGDIjVRAR 

GAAATaU)£:GGCAAACrATCACa^^ 
721 + + + + + + 780 

&XNAKLSPI.VAAQGRL6GVS 

rrGGCAAAAGCTGCTctGa^TCTGC^^ 
781 + -f + + + + 840 

XiAKAAttI^X>QG XKV6DPRIiPI 

ATGGCrrcCSVAATGAGCAGGAACr^^ 
841 + + + + + + 900 

MAPNBQEIiSALRSDHKKAGV 

CTATAA 
901 906 

L * 
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_ «f dapa ISSO TO HO". ^ 

n^rcCACrC PJvrJXOSK TTGGTCWAC 
I ATGGGAATCA IStOCGACG'V tCTGt»GCTT GTTGCRfilVGA 

51 TATIGTGGCA eCACTOATC taGACWVCGG CGCTGAMTt 

.101 TOeeCCtCGA TftACGCtcrG atcgqcracc tgovcttctg 

isi eTO«s««« ^ri^^ 

201 CMCAM3»C GGCM«^ GoCTOTCrtG AABGMUWGA CWUCtCCet 

251 CtOCTTTGGA GCIf^JJ^ T^OCtCT GOKStetTGK OCRIGGTCTT 

301 GMCtGKTCG OJOCTWOT TOCT^-^ 

331 TICCAAGCAG GCTCOO^ gSoCTTCRC GCRCCQGGASf CCBCACTGCT 

401 «»PX»JL O^J^ gJJ^JJ^ 

451 OVGWSCBTTG CiroeGCAa C^««*^ 

501 TCXGACCGRG WGGO^ .poe^^^ 
SSI TCCCiiGtT<» ^R^CGC ^^^^^ 

601 ATCTTT6GCA OO^G^ ^J^^ GGGTGT^ 

651 SSr^ 

701 iWSCACCCAGG CCTiM^TCGtA GGI^crK-w^ 



wo 01/49854 FCT/US00O5617 

ATGCGftATCAASCTTGG C Gl lCT CGG AWXaWACGCXy;TGTTGGTCAAM:TA.TTGTGGOV 
J 4. — . + , ♦ — • * 60 

HGI KVGVUGAKGRVGQTIVA 

GCAGTCAATGAGTCCa«lCGATCTGGAiX:TTCTTGCAGACllTC^ 

♦ ♦ V 4 ♦ •+ 120 

AVNESOOI'ELVASIGVDOOL 

AGCCTTC1^5GTAia«U«CGGra:T<yUUSTTGTCG 
121 + ♦ ♦--^ ^ WO 

SLLVONGABVVVOFTTPMAV 

ATGGGClVW»GGW5«CrGCM^^ 
181 : 1- 1 + ♦ »- ^ 

MGHLEPCXH-HGISAVVGTTG 

ttcgataatcctcgtttogjwcaggttcgcgcc^^ 
pohaki*eqvrawlegkdhvg 



241 



GTTCTGATCGCAa^TAACTTTGCTATCTCTGCGGTGTTGACCATGG^ 
301 — w ^ ^+ —4-^ -+ — — 350 



V L 



APNfAlSAVtTMVFSKQ 



361 4. i ^ ^ * + 

AARFFESACVIBLHHPHKLD 

GCACCTtCAGGCACCGCGATCXACACTGC^ 



A P S G T 



AIHTAQ GIAAARKEA 



GGCATGGACGOVCAGCCAGATGCGACCGAGCAGGOi^GAGGGtTCCCGT^^ 
4BX + ^ ^ * ^ ► 540 

G M O AQ P OAT EQA LEGSR^aXs 

m 

GTAGATGGAATXXCaGTXCAcGCAGTCCGO^GTCCG©^^ 

^*.4 — • — . — " t- 600 

V. DG ipvra'vrmscmvah^qv 

ATCTTTGGCACC(»CGGTCflGAOCXTGAC^^ 
6Q1 1«,««^-*^-4 --—4 ■ ■■*>4- 660 

I FGtQGQTI^TI.KQDSYORN S 

TTTGCACXZAGGTGTCTTGGTGGGrrGTGCCCAA 

FAPGVLVG VRNIAQHPGLVV 

GGACTTGAGCATTACCTAGGCCTGTAA 
721 . + + 747 . 

GLEHYX-Gt*- 
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Huclc^tildo. 




(SeO ZD HOi%} 



t ATOCATTtCO GTAAGCTCGA CXIAGGAC^ GCCACCACAA TTTTCGJ^GGA 

SI TTACAAlBAAC ATGACCAACA TCCGCGTAOC TATCGTaGGC TACGGAiytcc 

101 TG6GACGCAC OGTCGAAMlG CTTAnCCCA ACCAGOCCGA CATGGACCTT 

XSl GtAGCAATCT TCTCGCGCC6 GGCCACOCTC GACACAAAGA GGOCAGtCTT 

20L TGATGTOGOC GACGTGGACA AOCACGQCGA CGACGtGGAC GTGClGtTCC 

251 TGTGCAXGGG CTOCOCCfiCC GACATOCCTG AfiO^OGClyCC AAAGtTOQCG 

301 CAGnCGCCX GCnOOCTAGA CACCTACGAC AACCACCGCG ACATOCCftOG 

3SI CCACX:GCCAiG GTCATGAACG AAGCCGCCAC CGCA60CGGC AAOGTt(K:AC 

401 TGGTCTCTAC CGGCTGGGAT CCACGAATGT tCTCCAtCAA GOGCGTCTAC 

ASX OCAGCGOCAG TCTTAGCC6A GCACCAGCAG CACACCTTCt GGGGCCJCAGG 

301 TTtGTCACAG GQCCACTCOG ATGCTTTGOG ACGCATCCCT GGCGTTCAAA 

SSI AGGCcGTCCA GTACACCCTC CCATCCGAAG A^GCCCTGGA AAAlGGOCCGC 

601 CGTGGCGAAG CCGGCGACCT cACCGOAAAG CAAACCCACA AGCGCCAATG 

651 CTTCGTGGTT GCC6ACGCGG OCGA^^CACGA GOGCATCGAA AACGACATCC 

701 GCACCATGCC TGAWACTTC GTTGGCTACG AW5TCGAAGT CAACTTCATC 

751 GACGAAGCAA CCTTgGACQC CGAGCACACC GGCATGCCAC ACGGcGGaCA 

801 CGtGATcACC ACCGGCGACA CCGGTGGCTT CAACCACACC GT66AATACA 

851 TCCTgAAGCT GGACCGAAAC CCAGATTTCA CCGCTTCtrc ACAGATCGCT 

901 TTCGGcCGCG CAGCTCACXG CATGAAGCAG CAGGGCCAAA GCCGtGCTTT 

951 CACCGTCCTC GAAGTrGCTC CATACtTGCT CTCCCCgGAG AACTTGGAtG 

1001 ATCTGATCGC ACGCGAC6TC TAA 
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HTM t^VAlVGYCHl^eils X'CIC 



ISi ■» ..^^f^^ — » ■ ■ I 340 

OTKTVTrDVAOVftKIIAO DVS 

VLrLcn«8|iYQlpCQA» KTA 

30- ♦.._-._.^,„,^ ^ , 

o?AC7voTroifiiRor»ii!SRQ 

3^1 — ' ■■ I ■>^*-..--- ^ 1 ... . , I 120" 

«6i ^^^^^ ^ ^ ^ ^^ f' ^^^*^*^^ * ^* '^^ I iajj ccciitcocr 



S40 

KrrVCPGL$QCBSO. ALU ?. T p 

5«» — * , » ^ ^ 

C^QKAVQYTLpSEEALtS Aa 

RCe?. COLTGKQtRKllOC rVV 

ADAADHCntCHOIftTHei^Yr 

VC1fSVBV«riOCATl.Oii£KT 



OKraccHvitTCDTCcraB 



040 
T 



VEtlVKLORMPOrXASSQiK 



^1 
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qoi — ♦ 4 . ^— 4„ - X 

• • 960 

r G R A A H R M K o f:i C S C A r t V i. 

GAAGTTGCTCCATACtTGCTCTCCCCgCyiCAACTTGa^^ 

^. ^ 



EVA PY LLSPEHLDDLXAp. 



0 V 



TAA 

1021 1023 



n<j..ii> 
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SttQuttOCO o£ fall length ZvbA Cm JflRBL &-11474 (S^ XD OO: 
U] } OodBclined regims the pcinliig «it9 Cqk lysx prlaer 
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VSVBAn.SSftt.OTjKRSTIiDDIItaiiS^ 
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Huclootxde fiecjueacea ot AS019 lysA iSEQ ID MO^S) {j> 

1 ATCGCTACAG TTGJVAAATTT CAATOAACr? CCCGCACACG TATGGCCACG 
SI CAATGCCGTG CGCCAAGAAG ftCGGCjTTGT CACCGTOGCT GGTGTGCCTC 
101 TGCCTGACCT CGCTGAAGAA TACGGAACCC CZ-jCTGITCGT AGTCGACGAG 
ISl GAOGATTTQC GTTGCCOCTG TCCCGP^TG GCTACOGCliT TOGGTGGACC 
201 AGGCAATGTG CACtACGCAT CTAAiiSCGTT CCTGAOCAAd ACCATTGCAC 
2S1 GTTOGGnGA. TGIUU3»GGG6 CTGGCnCTGG ACATTGQOC GATdUyOGAA 
301 CtCGGCMTG GCCTGGCCGC tGGmOOOC GGCAOCCGTH tOCCGCGCh 
3SI CGGCAACAAC AAAGGCGXMS AGTTCCTGCG CG C GTT GO IT CZMAAOOGTC 
401 TCGGI^CA&GT GGMCTGGAC TCCGCMTAGG AACTAGMCT GTTG6ATTAC 
451 6TTGO0GCTG GTGIUU5GCM GftTTCkSGAC GTGTTGAICC GCGtAMGCC* 
501 AGGCA!T0i3AA GCACACACXIC AOCAGTTCM CSQCCACTAGC CACSGAA&ACC 
5SX AGAAGnboG ArTCTCCCTG GCATCCOGTT GOGCATTOGA A6CAGCAAAA 
601 GCOGOCAACA ACGCAGAAAA CCTGA.icCTG GTTGGCCTGC: AClGGCAlOGT 
651 TGGTTOCCAG GTGTTCGACG CCCIVAG:iCTT CAJ^GCXGOCA GCAGAACGCG 
701 TGTTGGQCCt GTACTCACAG ATCCftCAGCG AACTGGGGGT TCCCCTTCCT 
751 GAACTGGATC tCGGTGGCGG ATACX;3CATT GCCTATACCG CAGCTGAAGA 
801 ACCACTCAAC GTOGCAGA^JG TTGCC?CCGA CCTGCtCACC GCAGTCGGAA 
asi AAATGGCAGC GGAACTAGGC ATCX>AC<;CAC CAACCGTGCT TGrTGAGCCC 
SOI.. GGCCGOGCTA TGGCA6GCCC CTCCAQOGTG AOCAXCTACG AASTCGGCAC 
9S1 CACCAAAGAC GXCCACGTAG ACGACiACAA AACCCGCQGT TACATGGCCG 
1001 TGGAOGGAGG CATGTCCGAC AACATOIXSCC CAGCACtCTA CGGCTOCGAA 
lOSl TAGGACGCCC GGGTAGTAtC CCGCTTCGCC G^AGGAGACC CAGTAAGCAC 
1101 CCGCATCGT6 GGCTCCCACT GCGAATCCGG CGf.TATCCTG ATCAACGATG 
1151 AAATCTACCC ATCTGACATC ACCAGCGGCG ACTTCCTTGl: ACTCGCAGCC 
1201 ACCGGCGCAT ACTGCTACGC CATGAC-CTCC CGCTACAACG CCTtCACACG 
X251 GCCCGCCGTC GT6TCCGTCC GCGCTSGCAG CTCCCGCCTC ATGCtGCG<S£ 
130 X GCGAAACGCT C6ACGACATC CTCTdCTAG AQGCATAA 
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ATGCCPACAGXTCAWKAIrrCAATGAACTTCCCCCACACCTATG^^ 
I ♦ ♦ + • 60 

MATVeiirMCLPAHVHPRlCKV 

OWyUVGAAGACGGCGTTCTQlCX^ 
61 -H I ■ — + X20 

aOCDGVVTVAGVPLPDLAEE 

TACGGAACCCClUCTGTtOGTAGTCGAdGAG^ 
X21 ~ .. — lao 

YGT PIiFVYD'-EODFRSRCRDM 

GCTACOGSCATItlGGTGGAjXAOGCAATGTGG^ 
181 1 ■ I ■ .,^4..— — I «- I : 240 

ATA rCGPG H V UYASKAFLTK 
ACCATTCCACCnrrOGCTTGAIGftl^ 

TXARWVDBEGC^ALDIASZVE 
CTGGGCATTGCXX:TGGCCGCTG<rrTTCCCCX;^ 

LGXAXiAAG P^P ASRITAH GHH 

AAAGGCGtAGAGTTCCTGCGCGCGTTGCTTCAAAAa^ 
361 — -~ , — .^^.^..«.«.^4. 

KGV EFI.RAltVQHCVGHVV.LD 
TCCGCACA6GAI\Cn?AGAACn!GTtGGATTAC^ 

$AQEI.fiI.LDYVAAGEGKl~QD 

GTGTTGATCC»CCTAAAGCCACGCATCGAAGCACAC^^ 
481 .^--4-^ *■ — >■■ I — + — — 4^«— 540 

V LI RVKPGIEAHTHBFIA? S 

CACGAAGACCAGAAGTTCGGATTCrCX:CTGGCATCCGGTT^^ 
541 1—- + -+ — — — ♦ 600 

U BOQKFGF'SLASGSAFBAAK 
^ GCCGCCAACAACGCAGAAAACCTGAACO^GTTGGCXrr^ 

AAN NAENLNLVGLHCaVGSQ 
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I 

sea ^ '-'^^ Tp^^) 

^ CnxrrrCGACCCXCAAGGCTTCAAGCTGCCAGCAGAACGCGTCrrrG^ 

C6l. — 1 — — — ♦ -f-^ +^ ^ ^-jQ 

V fuaecfklaaervlclys q 

ATCCACAGCGAACTOGGCGTTGCCCTTCCTC/UVCTCC^ 

I USE1.GVALPBL.0LGGGYG !«. 

GCCTATAOOXyMSCTtaUUSAIVCCJ^^ 
781 — —I 1 ' 1 ^ ^ — I — + 840 

A YTAABEPL HVAEVAS D/.lTl. % 

GCAGTCGGAAAAMGGCAGCGGAACXAGGCATCGAOGCAi^^ 
841 4 I 4 -I : 1 — 8- 900 

AVGKUAAEIiGXDAe. TVLVB P 

GGCCGCGCXATOGCAGGCXOCTCCAOCCTGAO^^ 
901 (. 1 1 ——4 — — I—— 960 

CRAIA6 PSTVTIYEVGTTKD 

GTCXUVCGTAGACXACGACMUUVCCCGCC^ 
961 — h 4-— fr— .•4._»^ + 1020 

V HVD DDKTRaYlAVDGGHS D 

AACATCCGGCXAGCACtCTACGGCTCCGMTACGACGC^^ 
1021 4- 1 1 j.— ^ — . + « + X080 

N XRPALYGSEYDARVVSRFA 

GAAGOAOACCCAGTAAGCACCCGCATCGTGGGCTCCCACTdCC^^ 
108 1 — f —4 = — b ™+ 1140 

S CD P VSTRIVG SHCESGDI L 

ATCAiUIGATGAAATCTAaXavTCrGA^ 
X141 — — 4....—.^ — ♦ .-4 • — — 4, 1200 

X MDEXYPSDZYSGDrLALAA 

ACCGGCGCATACTGCIAOGOCAXGACC^ 
X201 ^..4.««.««,..-«^4^ 1 — f-— — ^4.-_^-«^ 1. 1260 

T GAYCYAMSSRYHAPTRPAV 

GT(^a:GTCCGCGCtGGCAGCTCCCGCCTCATGCtGaK^^ 
1261 + ^-^-4 4 —i ^.^i. — + 1320 

V SVRAGSSRLHLRRETLOD X 
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CTCTCACTACAGGCATAA 
132V 4 « 1338 

U 5 t E A « 



wo 01/49854 



22/33 



PCTArSOO/35617 



s^ss ^ssss SSS^ 

iSSciCGftA CTGCrrOCRA TGCtGOGlWr CtGCGCCMai TCMGGMGT 

^SSS StS^WG ftfiCMOOCIA reaMXKK TAT wcoaxs 

^SSS^SS gtS^KVCC CRIGRKttGS TCCGHOjOOS OCMtttTCC 

SJSSSS ?S?^a TOCwrac MOGoa^ 

SSSSr CTcSoMG JWCMCOGCA GrtOCet«IA CTTTTOVTGC 
SSSSSS tSScTCGG «OBCmCIl JOeiWCTGCT TAATGOGCta 

nS^OGMGR *COSN«G» CtTTTAAGGA lUUVMXAGGC 
^^^re ICcSoOCAA CGCTACftGAC TCCAOAKTCG 
SSSS SSS^ ATTTCIV™ CA«^ 
ISIISS oSSgTOGA AATCCGCGAA CTWSCGGTAe GRT6TTTAAG 
^SgOOC CRRCTGTTTT aSCTCATTTT GAIUTTGAAA 
SSSSSi ^MgSI AT6CCAACAA GCCCGTATGT CAtGGACtTT 



I 

51 
101 
151 
201 
251 
301 
351 

401 

4S1 

501 

551 

601 

651 

701 

751 TAA 
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GTGCCCGAACAACrrAAATT<y\GCGTGCACT 

IAaEQVkLSVEL I h C S S FTr P 

* 

CKrrGATCTTGAGTGGTCWUrrO^TGTTGAGGGCGCCGAAG^ 
61 — — ♦ ♦ * — ♦ 4- 120 

adve;hstovegaealvefag 
ogtgcctgctacgaaacttttcataaoccgaacc^ 

racyetfdkphpethsiiaay 



181 — ~i 1 — I I ,^4. 

I. RR I MEV6 RT. AI.LBHAIIA T M 

TATATOCGAGGCATTTCTCGGTGCGOGACCCft^^ 
241 — ♦ h-- 1 1 T— 1-* — + 300 

Y I RG ISRSATHELVRR EHTS 

* 

TTCrCTCAACTGTCTCAGCGTTTCGTGCACAGC^GACIUVTCGGJ^ 
301 + + ^ J. 4 ^ 3g(j 

FSQLSQRrvHSGBSEVVVpT 

CTCATCGATGAAGATGOGCAGTTGCGTGAACnTTCATGC^ 
361 r*.i.^..«.»..-«4...««.^«..— i'^^^, „4 — *«4. 420 

L X OfiDPQLRfiLrMHAHDESR 

TTCGCTTTCAATGAGCTGCTTAATGCGCTGGAAGAAA^^ 
421 ^..+ 1 ; 1 4— , r- — 1- 4lio 

rAFMELLRALCeKLGDEPtlA 

CTTTTAAGGAAAAAGCAGGCTCGTCAAGCAGCTCGCGCT^ 
481 +■ +™ — — +- f — + 540 

L I, rkkqahoaaravlpn at^'e 

TCCAGAATCGTGGTGTCTGGAAACTTCCGCACCTGGAGGO^ 
541 + ♦ — ♦ — — — -~-4 • — |. 1. 600 

SR ivvsghtrthrh figmra. 

AGTGAACATGCAGACGTCGAAATCCGOGAACTAGCGGTAGGATOT^ 
601 — — ~ i.^^— ......^^.w^ 1 4— ^^««---._+ 660 

S B HAOVEIRCVAVGCLRKLO 

GTAGCAGCGCXAACTGTrrTCGGTGATTTTG^ 

661 — — i ■ — i^^^. — ^. .•.«4. 720 

■ 

VAAPTVraOFEieTX.ADGSQ 

ATGGCAACAAGCGCGTATGTCATGGACtTTTAA 
■721 ~ ^ 1^ —4 .75i 



MATS PtVMDF 



4 
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'^•I'CC 13032 

ATCC 21529 
Consensu* 



ATCC 13032 
NJ13 

ATCC 21529 
Consensu^ 



ATCC 13032 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21S29 
Concensus 



ATCC 13032 
N13 

ATCC 21529 
Concensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 
N13 

ATCC 21529 
Consensus 



ATCC 13032 

ia3 

ATCC 21529 
Consensus 



HALWQKYCC SSLESAERIR KAGNcy/vVVC S^-XJOT 

51 



SO 



100 



LELAAAVMPV PfiAR] 
101 



CL TMSRISMAL VAKAtBSLGA EtOSPTCSQA 

150 



WLTTElUra, ABIVM^POR VREW^BcUcr CIVAOF^W/N KETRDVTTLO 
151 

200 

RGOSDTTAVA lAAAIKAIJVC EiysOVDGW TadWIV^kA QKLEKLSPEE 
201 

250 



MOELAAVGSK ILVLRSVEYA RAFNVPLRV?. SSYSMDPOTl, li^^ 
251 



-300 



EEAVLTCVAT DKSEAKVTVL GISOKKEAA KVPlUVLADAB tKIEMVLQNV 
301 



A 

A 



G 
0 



350 



SSVEDOTTDI TFTCPRADGR RA^rvv^CKL^ V^NWTNVLY K:2V^st.v 
351 

400 



OAG^aCSHPGV »;.EFMEM*D Vm,XELZ^ SSIRISVLIR EDDLDAAARA 
401 



421 



^-HCQFOLGQE D2AWYACTQ R 



fig..'} 
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OOGAGlTCATOQAITGGCnXJC^^ 

AAIAGTGCATCKK3CACCmMAaX5^^ 

CAACSAACTCCTACiaariT^ 
OTTAOCCCAAAAACKIAT^^ 
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1 
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sua: .^^ 
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jy <M SSSt -Eft 

1 CTGGCCGAAC AAGTTAAATT GAGCGTGGi^G TTGATACCCT GCAGTTCrTT 

51 TACTCCACCC GCTGATGTTC AGTGGTCAAC TGATGTTGAG GGCGCGGAAG 

lOX CACTCGTCGA GTTTGCOGGT CCTGCCTGCT ACGAAACTTT TGATMGCCC 

151 AWXXrtOyUl CIGCjnX»U\ TGCTGC^ 

201 GGG<XavCW:T GCTTTGCTTG JUSCAtGCCAA 1GCCM36MCG TM ATC06M 

251 GCATTTCTCG GTCCGCGACC CATGAATTGG IOOGACAOCG GCMTTtTOC 

301 TTCTCTCAAC TGTCrCAGCG TTTCCTGCftC l^flCGGAGAAT CGGMGCAGT 

351 GGTGCCCACT CTC^T* 
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Truncated 0RF2 amino acid sequence (SEQ ID NO: 19) 

GT6GC(»WyVAfiTrAAAnGAGCCTGGA6n6ATAfiCGTG^^ 
J + + + + + + 60 

MAEQVKLSVELIACSSFTPP 



61 



GCTGATGn-GAGnrefiTCAACTGATGTTGAGeOCGCGGAAfi^ 
+ + +..., + + + 120 

ADVEWSTDVEGAEALVEFA6 

CGreCCTeCTAC6/V\ACTmGATAAGCC6AACCXn^ 
121 + + + + + 

RACYETFDKPNPRTASNAAY 

CTeCOXADVTCATfiGAAfireGGGC^^ 
181 + •»• + — + 240 

LRHIMEV6HTALLEHANATM 



241 



TATATaXiAGQCAmCTCGGTaQC6AlXCAT6AATTGGra:6A(y\^^ 
^ + + + + + 300 

YIRGISRSATHELVRHRHFS 

TTCTCTCAACTGTCTCAQtmTCCTeCA(JVaQGAGA^^ 
301 + + + + + 

FSQLSQRFVHSGESEVVVPT 

CTCAT ... 
361 

L (I) 

FIG. 24 



RECTIF;£D SHEET (RULE 91) 

ISA/EP 
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a^TOPBSMiwimasiamBagaHB ^^ 
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-1- 



SEQOBNCE LISTING 



<110> Archer-Daniels-Midland Company 
Hanke, Paul D. 
Li-D'Elia, Lhing-Yew 
Rayapati/ John 

<120> Increased Lysine Production by Gene Anqplification 
<130> 1533.103PC03 

<150> OS 60/173,707 
<151> 1999-12-30 
<150> OS 60/184,130 
<151> 2000-02-22 

<160> 21 

<170> Patentin version 3.0 

<210> 1 

<211> 1266 

<212> DHA 

<213> ask 

<220> 

<221> CDS 

<222> (1)..(1266) 



<400> 1 

gtg gcc ctg gtc gta 
Met Ala Leu Val Val 
1 5 



cag aaa tat ggc ggt tec teg ctt gag agt gcg 
Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 

10 15 



48 



gtc get gaa egg ate gtt gcc ace aag aag get 
Val Ala Glu Arg He Val Ala Thr Lys Lys Ala 

25 30 



96 



gaa cgc att aga aac 
Glu Arg He Arg Asn 

20 
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-2- 

gga aat gat gtc gtg gtt gtc tgc tec gca atg gga gac acc acg gat 144 

Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 

35 40 45 

gaa ctt eta gaa ctt gca gcg gca gtg aat ccc gtt ccg cca get cgt 192 

Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 



gaa atg gat atg etc etg act get ggt gag cgt att tet aac get etc 
Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg He Ser Asn Ala Leu 
€5 70 75 80 

gtc gee atg get att gag tec ctt gge gca gaa get caa tct ttc act 
Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Phe Thr 

85 90 ' 95 



att gtt gac gtc aca ccg ggt cgt gtg cgt gaa gca etc gat gag gge 
He Val TVsp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
115 120 ' 125 



etg gaa aag etc age ttc gaa gaa atg etg gaa ctt get get gtt gge 
Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 

tec aag att ttg gtg etg cgc agt gtt gaa tac get cgt gca ttc aat 
Ser Lys He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 

gtg cca ctt cgc gta cgc teg tct tat agt aat gat ccc gge act ttg 
Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu 
225 230 235 240 

att gee gge tct atg gag gat att act gtg gaa gaa gca gtc ctt acc 
He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 

245 250 255 

ggt gtc gca acc gac aag tec gaa gcc aaa gta acc gtt etg ggt att 
Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly He 

260 265 270 

tec gat aag cca gge gag get gcc aag gtt ttc cgt gcg ttg get gat 
Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 



240 



288 



gge tet cag get ggt gtg etc ace acc gag cgc cac gga aac gca cgc 336 
Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg 

100 105 110 



384 



aag ate tgc att gtt get ggt ttt cag ggt gtt aat aaa gaa ace cgc 432 
Lys He Cys He Val Ala Gly Phe Gin Gly Val Asn Lys Glu Thr Arg 
130 135 140 



480 



528 



gat gtc acc acg ttg ggt cgt ggt ggt tet gac acc act gca gtt gcg 
Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 160 

ttg gca get get ttg aac get gat gtg tgt gag att tac teg gac gtt 
Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu He Tyr Ser Asp Val 

165 170 175 

gac ggt gtg tat ace get gac ccg cgc ate gtt cct aat gca cag aag 576 
Asp Glv Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 

180 185 190 



624 



672 



720 



7 68 



816 



864 
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gca gaa ate aac att gac atg gtt ctg cag aac gtc tec tet gtg gaa 912 
Ala Glu He Asn He Asp Met Val Leu Gin Asn Val Ser Ser Val Glu 
290 295 300 



gea gge aee gga cgc taa 
Ala Gly Thr Gly Arg 

420 



<210> 2 

<211> 421 

<212> PRT 

<213> ask 



<400> 2 

Met Ala Leu Val Val Gin Lys Tyr Gly Gly Ser Ser Leu Glu Ser Ala 
1 5 10 

Glu Arg He Arg Asn Val Ala Glu Arg He Val Ala Thr Lys Lys Ala 

• 20 25 30 

Gly Asn Asp Val Val Val Val Cys Ser Ala Met Gly Asp Thr Thr Asp 
35 40 45 

Glu Leu Leu Glu Leu Ala Ala Ala Val Asn Pro Val Pro Pro Ala Arg 
50 55 60 



960 



1008 



gac ggc acc acc gac ate acg ttc acc tgc cct cgc get gac gga cgc 
Asp Gly Thr Thr Asp He Thr Phe Thr Cys Pro Arg Ala Asp Gly Arg 
305 310 315 320 

cat qcg atg gag ate ttg aag aag ctt cag gtt cag ggc aac tgg acc 
Ara Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 

325 330 335 

aat gtg ctt tac gac gac cag gtc ggc aaa gtc tec etc gtg ggt get 1056 
Asn Val Leu Tyr Asp Asp. Gin Val Gly Lys Val Ser Leu Val Gly Ala 

340 345 350 

qqc atg aag tct cac cca ggt gtt ace gca gag ttc atg gaa get ctg 1104 
Glv Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 

cgc gat gtc aac gtg aac ate gaa ttg att tec ate tet gag ate cgc 1152 
Arg Asp Val Asn Val Asn He Glu Leu He Ser He Ser Glu He Arg 
370 375 380 

att tec gtg ctg ate cgt gaa gat gat ctg gat get get gca cgt gca 1200 
He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
385 390 395 400 

ttq cat gag cag ttc cag ctg ggc ggc gaa gac gaa gee gtc gtt tat 1248 
Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala Val Val Tyr 

405 410 415 



1266 
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Glu Met Asp Met Leu Leu Thr Ala Gly Glu Arg lie Ser Asn Ala Leu 

65 70 75 80 

Val Ala Met Ala He Glu Ser Leu Gly Ala Glu Ala Gin Ser Fhe Thr 

85 90 95 



Gly Ser Gin Ala Gly Val Leu Thr Thr Glu Arg His Gly Asn Ala Arg 

100 105 110 



lie Val Asp Val Thr Pro Gly Arg Val Arg Glu Ala Leu Asp Glu Gly 
115 120 125 

Lys He Cys lie Val Ala Gly Phe Gin Gly Val Asn Lys Glu Thr Arg 
130 135 140 

Asp Val Thr Thr Leu Gly Arg Gly Gly Ser Asp Thr Thr Ala Val Ala 
145 150 155 160 

Leu Ala Ala Ala Leu Asn Ala Asp Val Cys Glu lie Tyr Ser Asp Val 

165 170 175 



Asp Glv Val Tyr Thr Ala Asp Pro Arg He Val Pro Asn Ala Gin Lys 

180 185 190 

Leu Glu Lys Leu Ser Phe Glu Glu Met Leu Glu Leu Ala Ala Val Gly 
195 200 205 

Ser Lys He Leu Val Leu Arg Ser Val Glu Tyr Ala Arg Ala Phe Asn 
210 215 220 



Val Pro Leu Arg Val Arg Ser Ser Tyr Ser Asn Asp Pro Gly Thr Leu 
225 230 235 240 



He Ala Gly Ser Met Glu Asp He Pro Val Glu Glu Ala Val Leu Thr 

245 250 255 



Gly Val Ala Thr Asp Lys Ser Glu Ala Lys Val Thr Val Leu Gly He 

260 265 270 



Ser Asp Lys Pro Gly Glu Ala Ala Lys Val Phe Arg Ala Leu Ala Asp 
275 280 285 



Ala Glu He Asn He Asp Met Val Leu Gin Asn Val Ser Ser Val Glu 
290 295 300 



Asp Gly Thr Thr Asp He Thr Phe Thr Cys Pro Arg Ala Asp Gly Arg 
305 310 315 320 
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Ara Ala Met Glu He Leu Lys Lys Leu Gin Val Gin Gly Asn Trp Thr 
^ 325 330 335 

Asn Val Leu Tyr Asp Asp Gin Val Gly Lys Val Ser Leu Val Gly Ala 

340 345 350 

Glv Met Lys Ser His Pro Gly Val Thr Ala Glu Phe Met Glu Ala Leu 
355 360 365 

Ara Asp Val Asn Val Asn He Glu Leu He Ser He Ser Glu He Arg 
370 375 380 

He Ser Val Leu He Arg Glu Asp Asp Leu Asp Ala Ala Ala Arg Ala 
385 390 395 400 

* 

Leu His Glu Gin Phe Gin Leu Gly Gly Glu Asp Glu Ala -Val Val Tyr 

405 410 415 



Ala Gly Thr Gly Arg 

420 



<210> 3 

<211> 1035 

<212> DNA 

<213> asd 



<220> 

<221> CDS 

<222> (1) . . (1035) 



<400> 3 _^ 
atg acc acc ate gca gtt gtt ggt gca acc ggc cag gtc ggc cag gtt 
Met Thr Thr He Ala Val Val Gly Ala Thr Gly Gin Val Gly Gin Val 
15 10 15 

atg cgc acc ttt ttg gaa gag cgc aat ttc cca get gac act gtt cgt 
Met Arg Thr Phe Leu Glu Glu Arg Asn Phe Pro Ala Asp Thr Val Arg 

20 25 30 

ttc ttt get tec ceg cgt tec gca ggc cgt aag att gaa ttc cgt ggc 
Phe Phe Ala Ser Pro Arg Ser Ala Gly Arg Lys He Glu Phe Arg Gly 
35 40 45 

aeg gaa ate gag gta gaa gac att act cag gca acc gag gag tec etc 
Thr Glu He Glu Val Glu Asp He Thr Gin Ala Thr Glu Glu Ser Leu 
50 55 60 



48 



96 



144 



192 
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aaa aac ate gac gtt gcg ttg ttc tct get gga ggc ace get tec aag 
51 Gly ill Isp Val All Leu Phe Ser Ala Gly Gly Thr Ala Ser Lys 



240 



65 70 

caa tac act oca ctg ttt get get gea ggc gcg act gtt gtg gat aac 
Tyr S Pro Leu Phe Ala Ala Ala Gly Ala Thr Val Val Asp Asn 

85 90 

tet tct get tgg cgc aag gac gac gag gtt cca eta ate gtc tct gag 
ser Ser Ala Trp Arg Lys Asp Asp Glu Val Pro Leu He Val Ser Glu 

100 105 110 

gtg aac ect tec gac aag gat tec ctg gtc aag ggc att att gcg aat 
?al Asn Pro Ser Asp Lys Asp Ser Leu Val Lys Gly lie He Ala Asn 
115 120 125 

cct aac tqc acc ace atg get gea atg cca gtg ctg aag cca ctg cac 
iTo Sn cTs ?hr Thr Me? Ma Xla Met Pro Val Leu Lys Pro Leu Hxs 
130 135 

aat occ get ggt ett gta aag ett cac gtt tec tct tac cag get gtt 
Sa Ma lly Leu Val Lys Leu His Val Ser Ser Tyr Gin Ala Val 
145 150 155 

tec ggt tct ggt ett gea ggt gtg gaa ace ttg gea aag cag gtt get 
Ser Gly Ser Ily Leu Ala Gly Val Glu Thr Leu Ala Lys Gin Val Ala 

165 1''0 ^'^ 

oca gtt ggc gac cac aac gtt gag ttc gtc cat gat gga cag get get 
Ma val Gly Lp His Asn Val Glu Phe Val His Asp Gly Gin Ma Ma 

180 .185 

aac aca aac aat gtc gga cct tac gtt tec cca ate get tac aac gtg 
Sp Ma Gly S Val lly Pro Tyr Val Ser Pro He Ma Tyr Asn Val 
195 200 205 

eta cca ttc gcc gga aac etc gtc gat gac ggc acc ttc g^a acc gac 
Su ??o Phe Ma Gly Asn Leu Val Asp Asp Gly Thr Phe Glu Thr Asp 
210 215 ' 220 

aaa aaa caa aaa ctg cgc aac gaa tec cgc aag att etc ggc etc cca 
S« l!2 Sn lyl lei A?g Asn Glu Ser Arg Lys He Leu Gly Leu Pro 

225 230 235 



gac etc aag gtc tea ggc acc tgc gtc cgc gtg ccg gtt ttc acc ggc 
Lp Leu Lys Val Ser Gly Thr Cys Val Arg Val Pro Val Phe Thr Gly 

245 250 

cac acg ctg acc att cac gcc gaa ttc gac aag gea ate acc gtc gag 
His Thr Leu Thr He His Ala Glu Phe Asp Lys Ala He Thr Val Glu 

260 265 270 

cag gcg cag gag ate ttg ggt gcc get tea ggc gtc gag ett gtc gac 
Gin Ma Gin Glu lie Leu Gly Ala Ala Ser Gly Val Glu Leu Val Asp 
275 280 285 

gtc cca acc cca ett gea get gcc ggc att gac gaa tec etc gtt gga 
?al Pro Thr Pro Leu Ala Ala Ala Gly He Asp Glu Ser Leu Val Gly 
290 295 300 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 
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cgc ate cgt cag gac tec act gtc gac gac aac cgc ggt ctg gtt etc 
Acq He Arg Gin Asp Ser Thr Val Asp T^p Asn Arg Gly Leu Val Leu 
305 310 315 320 

gtc gta tct ggc gat aac ctt cgc aag ggc gca gca ctg aac ace att 
Val Val Ser Gly Asp Asn Leu Arg Lys Gly Ala Ala Leu Asn Thr He 

325 330 335 



<210> 4 

<211> 344 

<212> PRT 

<213> asd 



<400> 4 

Met Thr Thr He Ala Val Val Gly Ala Thr Gly Gin Val Gly Gin Val 
15 10 15 

Met Arg Thr Phe Leu Glu Glu Arg Asn Phe Pro Ala Asp Thr Val Arg 

20 25 30 

Phe Phe Ala Ser Pro Arg Ser Ala Gly Arg Lys lie Glu Phe Arg Gly 
35 40 45 

Thr Glu He Glu Val Glu Asp He Thr Gin Ala Thr Glu Glu Ser Leu 
50 . 55 60 

Lys Gly He Asp Val Ala Leu Phe Ser Ala Gly Gly Thr Ala Ser Lys 
65 70 75 80 

Gin Tyr Ala Pro Leu Phe Ala Ala Ala Gly Ala Thr Val Val Asp Asn 

85 90 95 

Ser Ser Ala Trp Arg Lys Asp Asp Glu Val Pro Leu He Val Ser Glu 

100 105 110 



Val Asn Pro Ser Asp Lys Asp Ser Leu Val Lys Gly He He Ala Asn 
115 120 125 



960 



1008 



cag att get gag ctg ctg gtt aag taa 1035 
Gin He Ala Glu Leu Leu Val Lys 

340 



Pro Asn Cys Thr Thr Met Ala Ala Met Pro Val Leu Lys Pro Leu His 
130 135 140 
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Asp Ala Ala Gly Leu Val Lys Leu His Val Ser Ser Tyr Gin Ala Val 
145 150 155 160 

Ser Gly Ser Gly Leu Ala Gly Val Glu Thr Leu Ala Lys Gin Val Ala 

165 170 175 

Ala Val Gly Asp His Asn Val Glu Phe Val His Asp Gly Gin Ala Ala 

180 185 190 

Asp Ala Gly Asp Val Gly Pro Tyr Val Ser Pro He Ala Tyr Asn Val 
195 200 205 



Leu Pro Phe Ala Gly Asn Leu Val Asp Asp Gly Thr Phe Glu Thr Asp 
210 215 220 

Glu Glu Gin Lys Leu Arg Asn Glu Ser Arg Lys He Leu Gly Leu Pro 
225 230 235 240 

Asp Leu Lys Val Ser Gly Thr Cys Val Arg Val Pro Val Phe Thr Gly 

245 250 255 



His Thr Leu Thr He His Ala Glu Phe Asp Lys Ala He Thr Val Glu 

260 265 270 

Gin Ala Gin Glu He Leu Gly Ala Ala Ser Gly Val Glu Leu Val Asp 
275 280 285 

Val Pro Thr Pro Leu Ala Ala Ala Gly He Asp Glu Ser Leu Val Gly 
290 295 300 

Ara He Arg Gin Asp Ser Thr Val Asp Asp Asn Arg Gly Leu Val Leu 
305 310 315 320 

Val Val Ser Giy Asp Asn Leu Arg Lys Gly Ala Ala Leu Asn Thr He 

325 330 335 



Gin He Ala Glu Leu Leu Val Lys 

340 



<210> 5 
<211> 906 



<212> DNA 
<213> dapA 
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<220> 

<221> CDS 

<222> (1)..(906) 



atragc^aca ggt tta aca get aag acc gga gta gag cac ttc ggc acc 
S? sir Thr Giy Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly Thr 

10 



1 5 



gat egg gcg aag etc ate gee ggt gtc gga acc aac aac acg egg aca 
Asp ftrl Ala Lys Leu He Ala Gly Val Gly Thr Asn Asn Thr Arg Thr 

85 ^0 

tet gtg gaa ctt gcg gaa get get get tct ^ct ggc gca gac ggc ctt 
Ser V^l Glu Leu Ala Glu Ala Ala Ala Ser Ala Gly Ala Asp Gly Leu 

100 105 HO 

tta gtt gta act cet tat tac tec aag ccg age caa gag gga ttg ctg 
Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu Leu 
115 120 125 

aca cac ttc ggt gca att get gca gca aca gag gtt eca att tgt etc 
SI nls Phe Gly La He Ila Ala Ala Thr Glu Val Pro He Cys Leu 
130 135 140 



tat gac att cet ggt egg tea ggt att oca att gaa tct gat acc atg 
T^r £p He Pro Gly Arg Ser Gly He Pro He Glu Ser Asp Thr Met 
145 150 155 



aaa cac ctg agt gaa tta cet acg att ttg gcg gte aag gac gee aag 
J?g ^g Leu Ser Glu Leu Pro Thr He Leu Ala Val Lys Asp Ala Lys 

165 I'^O ^'^ 

ggt gac etc gtt gca gcc acg tea ttg ate aaa gaa acg gga ctt gee 
Gly Lp Leu ^1 Ala Ala Thr Ser Leu He Lys Glu Thr Gly Leu Ala 

180 185 190 

tgg tat tea ggc gat gac eca eta aac ctt gtt tgg ctt get ttg ggc 
Trp Tyr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu Gly 
195 200 205 



48 



96 



144 



gtt gga gta gca atg gtt act cea ttc acg gaa tee gga gac ate gat 
Val Glv Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp He Asp 

20 25 30 

ate get get ggc ogc gaa gtc gcg get tat ttg gtt gat aag ggc ttg 
He Ala Ala Gly A^g Glu Val Ala Ala Tyr Leu Val Asp Lys Gly Leu 
35 40 45 

frat- tct tta att etc qcg age ace act ggt gaa tec eca acg aca ace 
III LeS val 2S 111 Gly Thr Thr Gly Glu Ser Pro Thr Thr Thr 
50 55 60 

gcc get gaa aaa eta gaa ctg etc aag gee gtt egt gag gaa gtt ggg 240 
ila Ala Glu Lys Leu Glu Leu Leu Lys Ala Val Arg Glu Glu Val Gly 
65 70 "75 



192 



288 



336 



384 



432 



480 



528 



576 



624 
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qga tea got ttc att tec gta att gga cat gca gcc ccc aca gca tta 
Glv Ser Gly Phe He Ser Vai He Gly His Ala Ala Pro Thr Ala Leu 
210 215 220 

cot gag ttg tae aea age ttc gag gaa ggc gac etc gtc cgt gcg egg 
Aro Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala Arg 
225 230 235 240 

qaa ate aac gcc aaa eta tea ecg etg gta get gee eaa ggt ege ttg 
Glu He Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg Leu 

245 - 250 255 

ggt gga gtc age ttg gca aaa get get etg cgt etg eag ggc ate aac 
Gly Gly Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly He Asn 

260 265 270 

gta gga gat cet cga ctt cca att atg get eea aat gag eag gaa ctt 
Val Gly Asp Pro Arg Leu Pro He Met Ala Pro Asn Glu Gin Glu Leu 
275 280 285 

gag get etc cga gaa gac atg aaa aaa get gga gtt eta taa 
Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu 
290 295 300 



<210> 6 

<211> 301 

<212> PRT 

<213> dapA 



<400> 6 

Met Ser Thr Gly Leu Thr Ala Lys Thr Gly Val Glu His Phe Gly Thr 
1 R 10 15 



Val Gly Val Ala Met Val Thr Pro Phe Thr Glu Ser Gly Asp He Asp 

20 25 30 

He Ala Ala Gly Arg Glu Val Ala Ala Tyr Leu Val Asp Lys Gly Leu 
35 40 45 

Asp Ser Leu Val Leu Ala Gly Thr Thr Gly Glu Ser Pro Thr Thr Thr 
50 55 60 

Ala Ala Glu Lys Leu Glu Leu Leu Lys Ala Val Arg Glu Glu Val Gly 
65 70 75 80 



672 



720 



768 



816 



864 



906 
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Ser Val Glu Leu Ala Glu Ala Ala Ala Ser Ala Gly Ala Asp Gly Leu 

100 105 110 

Leu Val Val Thr Pro Tyr Tyr Ser Lys Pro Ser Gin Glu Gly Leu Leu 
115 120 125 

Ala His Phe Gly Ala He Ala Ala Ala Thr Glu Val Pro He Cys Leu 
130 135 140 

Tvr Asp He Pro Gly Arg Ser Gly He Pro lie Glu Ser Asp Thr Met 
145 150 155 160 

Ara Arq Leu Ser Glu Leu Pro Thr He Leu Ala Val Lys Asp Ala Lys 
^ 165 "0 175 

Glv Asp Leu Val Ala Ala Thr Ser Leu He Lys Glu Thr Gly Leu Ala 

180 185 190 

Tro Tvr Ser Gly Asp Asp Pro Leu Asn Leu Val Trp Leu Ala Leu Gly 
^ 195 200 205 

Glv Ser Gly Phe He Ser Val He Gly His Ala Ala Pro Thr Ala Leu 
^ 210 215 220 

Ara Glu Leu Tyr Thr Ser Phe Glu Glu Gly Asp Leu Val Arg Ala Arg 
225 230 235 240 

Glu He Asn Ala Lys Leu Ser Pro Leu Val Ala Ala Gin Gly Arg Leu 

245 250 . 255 

Glv Glv Val Ser Leu Ala Lys Ala Ala Leu Arg Leu Gin Gly He Asn 

260 265 270 

Val Gly Asp Pro Arg Leu Pro He Met Ala Pro Asn Glu Gin Glu Leu 
275 280 285 



Glu Ala Leu Arg Glu Asp Met Lys Lys Ala Gly Val Leu 
290 295 300 



<210> 7 

<211> 747 

<212> DNA 

<213> BFIOO dapB 
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<220> 

<221> CDS 

<222> (1)..(747) 



<400> 7 ^ 4. 

atg gga ate aag gtt ggc gtt etc gga gcc aaa ggc cgt gtt.ggt caa 
Met Gly He Lys Val Gly Val Leu Gly Ala Lys Gly Arg Val Gly Gin 
1 5 10 15 

act att gtg gca gca gtc aat gag tec gac gat ctg gag ctt gtt gca 
Thr He Val Ala Ala Val Asn Glu Ser lVsp Asp Leu Glu Leu Val Ala 

20 25 30 

gag ate ggc gtc gac gat gat ttg age ctt ctg gta gac aac ggc get 
Glu He Gly Val Asp Asp Asp Leu Sex Leu Leu Val Asp Asn Gly Ala 
35 40 45 

gaa gtt gtc gtt gac ttc acc act cct aac get gtg atg ggc aac ctg 
Glu Val Val Val Asp Phe Thr Thr Pro Asn Ala Val Met Gly Asn Leu 
50 55 . 60 • 

aag ttc tgc ate aac aac ggc att tct gcg gtt gtt gga acc acg ggc 
Glu Phe Cys He Asn Asn Gly He Ser Ala Val Val Gly Thr Thr Gly 
65 70 75 80 

ttc gat aat get cgt ttg gag cag gtt cgc gcc tgg ctt gaa gga aaa 
Phe Asp Asn Ala Arg Leu Glu Gin Val Arg Ala Trp Leu Glu Gly Lys 

85 90 95 

gac aat gtc ggt gtt ctg ate gca cct aac ttt get ate tct gcg gtg 
Asp Asn Val Gly Val Leu He Ala Pro Asn Phe Ala He Ser Ala Val 

100 105 110 

ttg acc atg gtc ttt tee aag cag get gcc cgc ttc ttc gaa tea get 
Leu Thr Met Val Phe Ser Lys Gin Ala Ala Arg Phe Phe Glu Ser Ala 
115 120 125 

gaa gtt att gag ctg cac cac cce aac aag ctg gat gca cct tea ggc 
Glu Val He Glu Leu His His Pro Asn Lys Leu Asp Ala Pro Ser Gly 
130 135 140 

ace gcg ate cac act get cag ggc att get gcg gea cgc aaa gaa gca 
Thr Ala He His Thr Ala Gin Gly He Ala Ala Ala Arg Lys Glu Ala 
145 150 155 160 

ggc atg gac gca cag eca gat gcg ace gag cag gca ctt gag ggt tec 
Gly Met Asp Ala Gin Pro Asp Ala Thr Glu Gin Ala Leu Glu Gly Ser 

165 170 175 

egt ggc gca age gta gat gga ate cca gtt cac gca gtc cgc atg tec 
Arg Gly Ala Ser Val Asp Gly He Pro Val His Ala Val Arg .Met Ser 
. 180 185 190 

ggc atg gtt get cac gag caa gtt ate ttt ggc acc cag ggt cag ace 
Gly Met Val Ala His Glu Gin Val He Phe Gly Thr Gin Gly Gin Thr 
195 200 205 

ttg acc ate aag cag gac tec tat gat cgc aac tea ttt gca cca ggt 
Leu Thr He Lys Gin Asp Ser Tyr Asp Arg Asn Ser Phe Ala Pro Gly 
210 215 220 



4« 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 
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gtc ttg gtg ggt gtg cgc aac att gca cag cac cca ggc eta gtc gta ^ 720 

Val Leu Val Gly Val Arg Asn He Ala Gin His Pro Gly Leu Val Val 
225 230 235 240 

gga ctt gag cat tac eta ggc ctg taa 747 
Gly Leu Glu His Tyr Leu Gly Leu 

245 



<210> 8 

<211> 248 

<212> PRT 

<213> BFIOO dapB 



<400> 8 

Met Gly He Lys Val Gly Val Leu Gly Ala Lys Gly Arg Val Gly Gin 
15 10 15 

Thr He Val Ala Ala Val Asn Glu Ser Asp Asp Leu Glu Leu Val Ala 

20 25 30 

Glu He Gly Val Asp Asp Asp Leu Ser Leu Leu Val Asp Asn Gly Ala 
35 40 45 

Glu Val Val Val Asp Phe Thr Thr Pro Asn Ala Val Met Gly Asn Leu 
50 55 60 

Glu Phe Cys He Asn Asn Gly He Ser Ala Val Val Gly Thr Thr Gly 
65 70 75 80 

Phe Asp Asn Ala Arg Leu Glu Gin Val Arg Ala Trp Leu Glu Gly Lys 

85 90 95 

Asp Asn Val Gly Val Leu He Ma Pro Asn Phe Ala He Ser Ala Val 

100 105 110 

Leu Thr Met Val Phe Ser Lys Gin Ala Ala Arg Phe Phe Glu Ser Ala 
115 120 125 

Glu Val He Glu Leu His His Pro Asn Lys Leu Asp Ala Pro Ser Gly 
130 135 140 

Thr Ala He His Thr Ala Gin Gly He Ala Ala Ala Arg Lys Glu Ala 
145 150 155 160 

Glv Met Asp Ala Gin Pro Asp Ala Thr Glu Gin Ala Leu Glu Gly Ser 
^ 165 170 175 
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Ara Gly Ala Ser Val Asp Gly He Pro Val His Ala Val Arg Met Ser 
^ 180 185 

Gly Met Val Ala His Glu Gin Val He Phe Gly Thr Gin Gly Gin Thr 
195 200 205 

hen Thr He Lys Gin Asp Ser Tyr Asp Arg Asn Ser Phe Ala Pro Gly 
210 215 220 

val Leu Val Gly Val Arg Asn He Ala Gin His Pro Gly Leu Val Val 
225 230 235 ^4U 



Gly Leu Glu His Tyr Leu Gly Leu 

245 



<210> 9 

<211> 1023 

<212> DNA 

<213> BF 100 ddh 



<220> 

<221> CDS 

<222> (1)-.(1023) 



ato^cat^ttc aqt aag etc gac cag gac agt gcc acc aca att ttg gag 
£? lis Phe IVy Ss Leu Lp Gin Asp Ser Ala Thr Thr He Leu Glu 
15 10 15 

aat tac aag aac atg acc aac ate egc gta get ate gta ggc tac gga 
Sp ^yr 5s Me? Thr Asn He Arg Val Ala He Val Gly Tyr Gly 

20 25 30 

aac ctQ qqa cgc age gtc gaa aag ctt att gcc aag cag ccc gac atg 
III IZ A?g ser ^1 Glu Lys Leu lie Ala Lys Gin Pro Asp Met 
35 40 

gac ctt gta gga ate ttc teg cgc egg gcc acc etc gac aca adg acg 
Lp Leu Val Gly He Phe Ser Arg Arg Ala Thr Leu Asp Thr Lys Thr 
50 55 60 

cca gtc ttt gat gtc gcc gac gtg gac aag cac gcc gac gac gtg gac 
Pro Val Phe Asp Val Ala Asp Val Asp Lys His Ala Asp Asp Val Asp 
65 70 ^^ 8° 

qtg ctg ttc ctg tgc atg ggc tec gcc acc gac ate cct gag cag gca 
val Leu Phe Leu Cys Met Gly Ser Ala Thr Asp He Pro Glu Gin Ala 

85 90 95 



48 



96 



144 



192 



240 



288 
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cca aag ttc gcg cag ttc gcc tgc acc gta gac acc tac gac aac cac 336 
Pro Lys Phe Ala Gin Phe Ala Cys Thr Val Asp Thr Tyr Asp Asn Hxs 

100 105 110 



cgc gac ate cca cgc cac cgc cag gtc atg aac gaa gcc gcc acc gca 
Ara Asp He Pro Arg His Arg Gin Val Met Asn Glu Ala Ala Thr Ala 
^ 115 120 125 

qcc ggc aac gtt gca ctg gtc tct acc ggc tgg gat cca gga atg ttc 
Ala Gly Asn Val Ala Leu Val Ser Thr Gly Trp Asp Pro Gly Met Phe 
130 135 140 

tec ate aac cgc gtc tac gca gcg gca gtc tta gcc gag cac cag cag 
Ser He Asn Arg Val Tyr Ala Ala Ala Val Leu Ala Glu Hxs Gin Gin 
145 150 155 160 

cac acc ttc tgg ggc cca ggt ttg tea cag ggc cac tec gat get ttg 
His Thr Phe Trp Gly Pro Gly Leu Ser Gin Gly His Ser Asp Ala Leu 

165 170 175 

cga cgc ate cct ggc gtt caa aag gcc gtc cag tac acc etc cca tec 
Ara Ara He Pro Gly Val Gin Lys Ala Val Gin Tyr Thr Leu Pro Ser 

180 185 190 

gaa gaa gcc ctg gaa aag gcc cgc egt ggc gaa gcc ggc gae etc ace 
Glu Glu Ala Leu Glu Lys Ala Arg Arg Gly Glu Ala Gly Asp Leu Thr 
195 200 205 

gga aag caa acc cac aag cgc caa tgc ttc gtg gtt gcc gac gcg gcc 
Gly Lys Gin Thr His Lys Arg Gin Cys Phe Val Val Ala Asp Ala Ala 
210 215 220 

gac cac gag cgc ate gaa aac gac ate cgc acc atg cct gat tac ttc 
Asp His Glu Arg He Glu Asn Asp He Arg Thr Met Pro Asp Tyr Phe 
225 230 235 240 

gtt ggc tac gaa gtc gaa gtc aac ttc ate gac gaa gca acc ttg gac 
Val Glv Tyr Glu Val Glu Val Asn Phe He Asp Glu Ala Thr Leu Asp 
' •' 245 250 255 

gcc gag cac ace ggc atg cca cac ggc gga cac gtg ate acc acc ggc 
Ala Glu His Thr Gly Met Pro His Gly Gly His Val He Thr Thr Gly 

260 265. 270 

gac acc ggt ggc ttc aac cac acc gtg gaa tac ate ctg aag ctg gae 
Asp Thr Gly Gly Phe Asn His Thr Val Glu Tyr He Leu Lys Leu Asp 
275 280 285 

cga aac cca gat ttc acc get tct tea cag ate get ttc ggc cgc gca 
pig Asn Pro Asp Phe Thr Ala Ser Ser Gin ^le Ala Phe Gly Arg Ala 
290 295 300 

get cac cgc atg aag cag cag ggc caa age ggt get ttc acc gtc etc 
Ala His Arg Met Lys Gin Gin Gly Gin Ser Gly Ala Phe Thr Val Leu 
305 310 315 320 

gaa gtt get cca tac ttg etc tec ccg gag aac ttg gat gat ctg ate 
Glu Val Ala Pro Tyr Leu Leu Ser Pro Glu Asn Leu Asp Asp Leu lie 

325 330 335 

gca cgc gac gtc taa 
Ala Arg Asp Val 

340 



384 



432 



480 



528 



576 



624 



672 



. 720 



768 



816 



864 



912 



960 



1008 



1023 
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10 


<211> 


340 


<212> 


PRT 


<213> 


BF 


<400> 


10 



Met His Phe Gly Lys Leu Asp Gin Asp Ser Ala Thr Thr He Leu Glu 
15 10 15 

Asp Tyr Lys Asn Met Thr Asn He Arg Val Ala He Val Gly Tyr Gly 

20 25 30 



Asn Leu Gly Arg Ser Val Glu Lys Leu He Ala Lys Gin Pro Asp Met 
35 40 45 



Asp Leu Val Gly He Phe Ser Arg Arg Ala Thr Leu Asp Thr Lys Thr 
50 55 60 



Pro Val Phe Asp Val Ala Asp Val Asp Lys His Ala Asp Asp Val Asp 
65 70 '75 80 



Val Leu Phe Leu Cys Met Gly Ser Ala Thr Asp He Pro Glu Gin Ala 

85 .90 95 



Pro Lys Phe Ala Gin Phe Ala Cys Thr Val Asp Thr Tyr Asp Asn His 

100 105 110 



Arg Asp He Pro Arg His Arg Gin Val Met Asn Glu Ala ALac Thr Ala 
115 120 125 



Ala Gly Asn Val Ala Leu Val Ser Thr Gly Trp Asp Pro Gly Met Phe 
130 135 140 



Ser He Asn Arg Val Tyr Ala Ala Ala Val Leu Ala Glu His Gin Gin 
145 150 155 160 



His Thr Phe Trp Gly Pro Gly Leu Ser Gin Gly His Ser Asp Ala Leu 

165 170 175 



Arg Arg He Pro Gly Val Gin Lys Ala Val Gin Tyr Thr Leu Pro Ser 

180 185 190 



Glu Glu Ala Leu Glu Lys Ala Arg Arg Gly Glu Ala Gly Asp Leu Thr 
195 200 205 
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Glv Lys Gin Thr His Lys Arg Gin Cys Phe Val Val Ala Asp Ala Ala 
^ 210 215 220 

Asp His Glu Arg He Glu Asn Asp He Arg Thr Met Pro Asp Tyr Phe 
225 230 235 240 

Val Gly Tyr Glu Val Glu Val Asn Phe He Asp Glu Ala Thr Leu Asp 

245 250 255 

laa Glu His Thr Gly Met Pro His Gly Gly His Val He Thr Thr Gly 

260 265 270 

Asp Thr Gly Gly Phe Asn His Thr Val Glu Tyr He Leu Lys Leu Asp 
275 280 285 

Arg Asn Pro Asp Phe Thr Ala Ser Ser Gin He Ala Phe Gly Arg Ala 
290 295 300 

Ala His Arg Met Lys Gin Gin Gly Gin Ser Gly Ala Phe Thr Val Leu 
305 310 315 

Glu val Ala Pro Tyr Leu Leu Ser Pro Glu Asn Leu Asp Asp Leu He 

325 330 335 



Ala Arg Asp Val 

340 



<210> 11 

<211> 1338 

<212> DNA 

<213> full length LysA 



<220> 

<221> CDS 

<222> (1)..(1338) 



<400> 11 *- 
atg get aca gtt gaa aat ttc aat gaa ctt ccc gca cac gta tgg cca 
Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 
15 10 15 

cgc aat gca gtg cgc caa gaa gac ggc gtt gtc acc gtc get ggt gtg 
Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 
''20 .25 30 



48 



96 
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cct ctg cct gac etc get gaa gaa tac gga ace cca ctg ttc gta gtc 144 
Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 



gac gag gac gat ttc cgt tec cgc tgt cgc gac atg get acc gca ttc 
Asp Glu Asp Asp Phe Arg Ser Arg Cys Arg Asp Met Ala Thr Ala Phe 
50 55 60 

* 

ggt gga cca ggc aat gtg cac tac gca tee aaa gcg ttc ctg acc aag 
Gly Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 
65 70 75 80 

acc att gca cgt tgg gtt gat gaa gag ggg ctg gca ctg gac att gcg 
Thr lie Ala Arg Trp Val Asp Glu Glu Gly Leu Ala Leu Asp lie Ala 

85 90 95 



ttg gaa ctg ctg gat tac gtt gcc get ggt gaa ggc aag ate cag gac 
Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys He Gin Asp 
145 150 155 160 

gtg ttg ate cgc gtg aag cca ggt ate gaa gee cac acc cac gag ttc 
Val Leu He Arg Val Lys Pro Gly He Glu Ala His Thr His Glu Phe 

165 170 175 

ate gcc act age cac gaa gac cag aag ttc gga ttc tec ctg gca tee 
He Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 

180 185 190. 

ggt tec gca ttc gaa gca gcg aaa gca gee aac aat gca gag aac ttg 
Gly Ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 
195 200 205 

aac ctg gtt ggt ctg cac tgc cat gtt ggt tec cag gtg ttc gac gee 
Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Phe Asp Ala 
210 215 220 

gaa ggc ttc aag ctg gca gca gag cgc gtg ttg ggc ctg tac tea cag 
Glu Gly Phe Lys Leu Ala Ala Glu Arg Val Leu Gly Leu Tyr Ser Gin 
225 230 235 240 

ate cac age gaa eta ggt gtc gcc ctt cct gag ctg gac etc ggt ggc 
He His Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 

245 250 255 

gga tac ggc ate gee tac act gca gat gag gaa cca etc aac gtc gca 
Glv Tyr Gly He Ala Tyr Thr Ala Asp Glu Glu Pro Leu Asn Val Ala 

260 265 270 

gaa gtc gee tec gac eta etc acc gca gtc gga aaa atg gca gcg gaa 
Glu Val Ala Ser Asp Leu Leu Thr Ala Val Gly Lys Met Ala Ala Glu 
275 280 285 



192 



24*0 



288 



tec -ate aat gaa ctg ggc att gcc ctg gcc get ggt ttc ccg gcc age 336 
Ser He Asn Glu Leu Gly He Ala Leu Ala Ala Gly Phe Pro Ala Ser 

100 105 110 

cgt ate ace gcg cac ggc aac aac aaa ggc gta gag ttc ctg cgc gcg 384 
Arg He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 
115 120 125 

ttg gtt caa aac ggt gtc ggg eat gtg gtg ctg gac tec gcg cag gaa 432 
Leu Val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 
130 135 140 



480 



528 



576 



624 



672 



720 



768 



816 



864 
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cta ggc ate gac gca cca acc gtg ctt gtt gag ccc ggc cgc get ate 912 
Leu Gly He Asp Ala Pro Thr Val Leu Val Glu Pro Gly Arg Ala He 
290 295 300 

aca- ggc ccc tec acc gtg acc ate tac gaa gtc ggc acc acc aaa aac 
Ala Gly Pro Ser Thr Val Thr He Tyr Glu Val Gly Thr Thr Lys Asn 
305 310 315 320 



gte cac gta gac gac gae aaa acc cgc egc tae gta gee gtc gae gga 
Val His Val Asp Asp Asp Lys Thr Arg Arg Tyr Val Ala Val Asp Gly 

. 325 330 335 

ggc atg tec gac aac ate cgc cca gca etc tac ggc tec gaa tae gae 
Gly Met Ser Asp Asn He Arg Pro Ala Leu Tyr Gly Ser Glu Tyr Asp 

340 345 350 

gee cgc gta gta tec cgc ttc gee gaa gga gac cca gta age acc cgc 
Ala Aig Val Val Ser Arg Phe Ala Glu Gly Asp Pro Val Ser Thr Arg 
355 360 • 365 

ate gtg ggc tee cac tgc gaa tec ggc gat ate ctg ate aac gat gaa 
He Val Gly Ser His Cys Glu Ser Gly Asp He Leu He Asn Asp Glu 
370 375 380 



ate tac cca tct gac ate acc- age ggc gae ttc etc gca etc gca gee 
He Tvr Pro Ser Asp He Thr Ser Gly Asp Phe Leu Ala Leu Ala Ala 
385 390 395 400 

acc ggc gca tac tgc tac gee atg age tee egc tae aac gee ttc aca 
Thr Gly Ala Tyr Cys Tyr Ala Met Ser Ser Arg Tyr Asn Ala Phe Thr 

405 410 415 

egg ccc gee gte gtg tec gte cgc get ggc age tec cgc etc atg ctg 
P^Q Pro Ala Val Val Ser Val Arg Ala Gly Ser Ser Arg Leu Met Leu 
^ 420 425 430 

cgc egc gaa acc etc gae gae ate etc tea eta gag gca taa 
Arq Arg Glu Thr Leu Asp Asp He Leu Ser Leu Glu Ala 
"435 440 445 

<210> 12 

<211> 445 

<212> PRT 

<213> full length LysA 



<400> 12 

Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 

«i 10 15 



Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 

20 25 30 

Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1338 
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ASP Glu A3P A3P Phe Arg Ser Arg Cys Arg Asp Met Ala Thr Ala Phe 
50 55 60 

Gly Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 
65 ''O 

Thr He Ala Arg Trp Val Asp Glu Glu Gly Leu Ala Leu Asp lie Ala 



Ser 



He Asn Glu Leu Gly He Ala Leu Ala Ala Gly Phe Pro Ala Ser 



100 



105 



110 



Arg 



He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 



115 



120 



125 



Leu 



val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 



130 



135 



140 



Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys He Gin Asp 
145 150 



Val Leu He Arg Val Lys Pro Gly He Glu Ala His Thr His Glu Phe 



165 



170 



He 



Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 



180 



185 



190 



Gly ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 



195 



200 



Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Phe Asp Ala 

215 2^0 



210 



Glu Gly Phe Lys Leu Ala Ala Glu Arg Val Leu Gly Leu Tyr Ser Gin 



225 



230 



235 



He His 



is Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 



245 



250 



Gly Tyr Gly He Ala Tyr Thr Ala Asp Glu Glu Pro Leu Asn Val Ala 



260 



265 



Glu val Ala Ser Asp Leu Leu Thr Ala Val Gly Lys Met Ala Ala Glu 

280 285 



275 



Leu Gly He Asp Ala Pro Thr Val Leu Val Glu Pro Gly Arg Ala He 

295 



290 
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Ala Gly Pro Ser Thr Val Thr He Tyr Glu Val Gly Thr Thr Lys Asn 
305 310 315 32Q 

Val His Val Asp Asp Asp Lys Thr Arg Arg Tyr Val Ala Val Asp Gly 

325 330 335 

Gly Met Ser Asp Asn lie Arg Pro Ala Leu Tyr Gly Ser Glu Tyr Asp 

340 345 350 

Ala Arg Val Val Ser Arg Phe Ala Glu Gly Asp Pro Val Ser Thr Arg 
355 360 365 

He Val Gly Ser His Cys Glu Ser Gly Asp He Leu He Asn Asp Glu 
370 375 380 

He Tyr Pro Ser Asp He Thr Ser Gly Asp Phe Leu Ala Leu Ala Ala 
385 390 395 400 

Thr Gly Ala Tyr Cys Tyr Ala Met Ser Ser Arg Tyr Asn Ala Phe Thr 

405 410 415 

Arg Pro Ala Val Val Ser Val Arg Ala Gly Ser Ser Arg Leu Met Leu 
^ 420 425 430 

Arq Arg Glu Thr Leu Asp Asp He Leu Ser Leu Glu Ala 
435 440 445 



<210> 13 

<2li> 1338 

<212> DMA 

<213> AS019 lysA (pRS6) 



<220> 

<221> CDS 

<222> (1),.(1338) 



<400> 13 ^ 
atg get aca gtt gaa aat ttc aat gaa ctt ccc gca cac gta tgg cca 
Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 
15 10 15 

cgc aat gcc gtg cgc caa gaa gac ggc gtt gtc acc gtc get ggt gtg 
Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 

20 25 30 



48 



96 



wo 01/49854 



PCTAJS00y35617 



-22- 



cct ctg cct gac etc get gaa gaa tac gga acc cca ctg ttc gta gtc 144 
Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 

gac gag gac gat ttc cgt tec cgc tgt cgc gac atg get ace gca ttc 192 
Asp Glu Asp Asp Phe Arg Ser Arg Cys Arg Asp Met Ala Thr Ala Phe 
50 55 60 . 



got gga cca ggc aat gtg cac tac gca tet aaa geg tte ctg aee aag 
Gly Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 
65 70 75 80 

acc att gca cgt tgg gtt gat gaa gag ggg ctg gca ctg gac att gea 
Thr He Ala Arg Trp Val Asp Glu Glu Gly Leu Ala Leu Asp He Ala 

85 90 95 



cgt ate ace geg eae gge aae aac aaa ggc gta gag ttc ctg cgc gcg 
Ara He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 
115 120 125 

ttg gtt caa aae ggt gtg gga cae gtg gtg ctg gac tec gea cag gaa 
Leu Val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 
130 135 140 

eta gaa ctg ttg gat tac gtt gee get ggt gaa gge aag att cag gac 
Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys He Gin Asp 
145 150 155 160 

atq ttg ate ege gta aag eea gge ate gaa gea cae acc cae gag ttc 
Val Leu He Arg Val Lys Pro Gly He Glu Ala His Thr His Glu Phe 

165 170 175 



gga tac ggc att gee tat acc gca get gaa gaa cca etc aac. gtc gca 
Glv Tvr Gly He Ala Tyr Thr Ala Ala Glu Glu Pro Leu Asn Val Ala 

260 265 270 

gaa gtt gee tec gac ctg etc acc gca gtc gga aaa atg gea gcg gaa 
Glu Val Ala Ser Asp Leu Leu Thr Ala Val Gly Lys Met Ala Ala Glu 
275 280 285 



240 



288 



tec ate aac gaa ctg ggc att gcc ctg gcc get ggt ttc ccc gcc age 336 
Ser He Asn Glu Leu Gly He Ala Leu Ala Ala Gly Phe Pro Ala Ser 

100 105 . 110 



384 



432 



480 



528 



ate gee act age eae gaa gac cag aag tte gga ttc tec ctg gca tec 576 
He Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 

180 185 190 



672 



ggt tec gca ttc gaa gca gea aaa gee gcc aac aae gca gaa aac ctg 624 
Gly Ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 
195 200 205 

aac ctg gtt ggc ctg cae tge cae gtt ggt tee cag gtg tte gac gcc 
Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Phe Asp Ala 
210 215 220 

gaa gge ttc aag ctg gca gca gaa cgc gtg ttg gge ctg tac tea cag 
Glu Gly Phe Lys Leu Ala Ala Glu Arg Val Leu Gly Leu Tyr Ser Gin 
225 230 235 240 

ate cac age gaa ctg ggc gtt gcc ctt cct gaa ctg gat etc ggt ggc 768 
He His Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 

245 250 255 



720 



816 



864 
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cta ggc ate gac gca cca acc gtg ctt gtt gag ccc ggc cgc get ate 
Leu Gly He Lp Ala Pro Thr Val Leu Val Glu Pro Gly Arg Ala He 

^2 

nna aac CCC tcc acc otc acc ate tac gaa gtc ggc acc acc aaa gac 
IJa Ily Pro Ser ?hr 5^1 Thr He Tyr Glu Val Gly Thr Thr Lys Asp 

305 310 315 

gtc cac gta gac gac gac aaa acc cgc cgt tac ate gee gtg gac gga 
V^l His ?al Lp Lp Asp Lys Thr Arg Arg Tyr He Ala Val Asp Gly 

325 330 -^"^ 

« 

aac ata tcc qac aac ate ego cca gca etc tac ggc tcc gaa tac gac 
Sy hII ser Sp Asn He A?g Pro Ala Leu Tyr Gly Ser Glu Tyr Asp 

gee cgc gta gta tcc cgc ttc gcc gaa gga gac cca gta age acc cgc 
La Arg Val Val Ser Arg Phe Ala Glu Gly Asp Pro Val Ser Thr Arg 
35 5 

ate gtg ggc tcc cac tgc gaa tcc ggc gat ate ctg ate aac gat gaa 
Se val Gly Ser His C?8 Glu Ser Gly Asp He Leu He Asn Asp Glu 



370 375 



ate tac cca tct gac ate ace age ggc gac ttc ctt gca etc gca gcc 
III ITr Lp He Thr Ser Gly Asp Phe Leu Ala Leu Ala Ala 

385 390 

acc ggc gca tac tgc tac gcc atg age tcc cgc tac aac gcc ttc aca 
Thr Ily Ala Tyr Cys Tyr Ala Met Ser Ser Arg Tyr Asn Ala Phe Thr 

405 410 '^^^ 



caa ccc acc otc gtg tcc gtc cgc get ggc age tcc cgc etc atg ctg 
Sg Pro Sa Vel Sal Ser ^1 Arg Ala Gly Ser Ser Arg Leu Met Leu 



420 430 

cgc cgc gaa acg etc gac gac ate etc tea eta gag gca taa 
Glu 
435 



cgc cgc gaa acg ccc gac g^u auu, -™ - 

Arg Arg Glu Thr Leu Asp Asp He Leu Ser Leu Glu Ala 
^ ^ 440 445 



<210> 14 

<211> 445 

<212> PRT 

<213> AS019 lysA (pRS6) 



<400> 14 

Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 

Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 

20 25 30 

Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 



912 



960 



1008 



1056 



1104 



1152 



1200 



1248 



1296 



1338 
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Asp Glu Asp Asp Phe Arg Ser Arg Cys Arg Asp Met Ala Thx Ala Phe 

55 60 



50 



Gly Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 



65 



70 



75 



Thr lie Ala Arg Trp Val Asp Glu Glu Gly Leu Ala Leu Asp lie Ala 

90 95 



85 



Ser He Asn Glu Leu Gly He Ala Leu Ala Ala Gly Phe Pro Ala Ser 

100 105 110 



Arg He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 



115 



120 



Leu Val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 
130 135 140 

Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys He Gin Asp 
145 150 155 . 160 

val Leu He Arg Val Lys Pro Gly He Glu Ala His Thr His Glu Phe 

165 "0 1'5 

He Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 

180 185 190 



Gly Ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 

200 205 



195 



Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Phe Asp Ala 

215 220 



210 



Glu Gly Phe Lys Leu Ala Ala Glu Arg Val Leu Gly Leu Tyr Ser Gin 



225 



230 



235 



He His Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 

245 250 255 



Gly Tyr Gly He Ala Tyr Thr Ala Ala Glu Glu Pro Leu Asn Val Ala 



260 



265 



Glu Val Ala Ser Asp Leu Leu Thr Ala Val Gly Lys Met Ala Ala Glu 

280 285 



275 
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Leu Gly lie Asp Ala Pro Thr Val Leu Val Glu Pro Gly Arg Ala lie 
290 295 300 

Ala Gly Pro Ser Thr Val Thr He Tyr Glu Val Gly Thr Thr Lys Asp 
305 310 315 320 

« 

I 

Val His Val Asp Asp Asp Lys Thr Arg Arg *yr He Ala Val Asp Gly 

325 330 335 

Glv Met Ser Asp Asn He Arg Pro Ala Leu Tyr Gly Ser Glu Tyr Asp 

340 345 350 

Ala Arg Val Val Ser Arg Phe Ala Glu Gly Asp Pro Val Ser Thr Arg 
355 360 365 

He Val Gly Ser His Cys Glu Ser Gly Asp He Leu He Asn Asp Glu 
370 375 380 

He Tvr Pro Ser Asp He Thr Ser Gly Asp Phe Leu Ala Leu Ala Ala 
385 390 395 400 

Thr Glv Ala Tyr Cys Tyr Ala Met Ser Ser Arg Tyr Asn Ala Phe Thr 
' 405 410 

Ara Pro Ala Val Val Ser Val Arg Ala Gly Ser Ser Arg Leu Met Leu 

420 425 430 

Arg Arg Glu Thr Leu Asp Asp He Leu Ser Leu Glu Ala 
^ 435 440 445 



<210> 15 

<211> 753 

<212> DNA 

<213> or£2 in dapBA operon 



<220> 

<221> CDS 

<222> (1)..(753) 



<400> 15 4. 4. 

gtg gcc gaa caa gtt aaa ttg age gtg gag ttg ata gcg tgc agt tct 
Met Ala Glu Gin Val Lys Leu Ser Val Glu Leu He Ala Cys Ser Ser . 
1 5 10 15 



48 
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ttt act cca ccc get gat gtt gag tgg tea act gat gtt gag ggc gcg 
Phe Thr Pro Pro Ala Asp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 

20 25 30 

gaa gca etc gtc gag ttt gcg ggt cgt gee tgc tac gaa act ttt gat 
Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 
35 40 *^ 

aag ccg aac cct cga act get tec aat get gcg tat etg cgc cac ate 
Lys Pro Asn Pro Arg Thr Ala Ser Asn Ala Ala Tyr Leu Arg His lie 
50 55 60 

atg gaa gtg ggg eae act get ttg ett gag cat gee aat gee acg atg 
Met Glu Val Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 
65 70 75 80 

tat ate cga gge att tct egg tec gcg ace cat gaa ttg gtc cga cac 
Tvr He Arg Gly He Ser Arg Ser Ala Thr His Glu Leu Val Arg Hxs 
' 85 90 95 

cgc cat ttt tec ttc tct caa etg tct cag cgt tte gtg cac age gga 
Arg His Phe Ser Phe Ser Gin Leu Ser Gin Arg Phe Val His Ser Gly 

100 105 110 

gaa teg gaa gta gtg gtg cec act etc ate gat gaa gat ccg cag ttg 
Glu Ser Glu Val Val Val Pro Thr Leu He Asp Glu Asp Pro Gin Leu 
115 120 125 

cgt gaa ett ttc atg cac gee atg gat gag tct egg tte get tte aat 
Aig Glu Leu Phe Met His Ala Met Asp Glu Ser Arg Phe Ala Phe Asn 
130 . 135 140 

gag etg ett aat gcg etg gaa gaa aaa ett gge gat gaa ccg aat gca 
Glu Leu Leu Asn Ala Leu Glu Glu Lys Leu Gly Asp Glu Pro Asn Ala 
145 150 155 160 

ett tta agg aaa aag cag get cgt caa gca get cgc get gtg etg ccc 
Leu Leu Arg Lys Lys Gin Ala Arg Gin Ala Ala Arg Ala Val Leu Pro 

165 170 175 

aac get aca gag tee aga ate gtg gtg tet gga aac ttc cgc ace tgg 
Asn Ala Thr Glu Ser Arg He Val Val Ser Gly Asn Phe Arg Thr Trp 

180 185 190 

agg eat ttc att ggc atg cga gee agt gaa cat gca gac gtc gaa ate 
Arg His Phe He Gly Met Arg Ala Ser Glu His Ala Asp Val Glu He 
195 200 205 



cgc gaa gta gcg gta gga tgt tta aga aag etg cag gta gca gcg cca 
Sg Glu val Ala Val Gly Cys Leu Arg Lys Leu Gin Val Ala Ala Pro 
210 215 220 

act gtt tte ggt gat ttt gag att gaa act ttg gca gae gga teg caa 
Thr Val Phe Gly Asp Phe Glu He Glu Thr Leu Ala Asp Gly Ser Gin 
225 230 235 ^'^^ 



atg gca aca age ccg tat gtc atg gac ttt taa 
Met Ala Thr Ser Pro Tyr Val Met Asp Phe 

245 250 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 



753 
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<210> 


16 


<211> 


250 


<212> 


PRT 


<213> 


orf2 


<400> 


16 



Met Ala Glu Gin Val Lys Leu Ser Val Glu Leu lie Ala Cys Ser Ser 
1 5 10 15 

Phe Thr Pro Pro Ala Asp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 

20 25 30 



Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 
35 40 45 

Lys Pro Asn Pro Arg Thr Ala Ser Asn Pila Ala Tyr Leu Arg His He 
50 55 60 

Met Glu Val Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 
65 70 75 80 

Tvr He Ara Gly He Ser Arg Ser Ala Thr His Glu Leu Val Arg His 
^85 90 95 



Arg His Phe Ser Phe Ser Gin Leu Ser Gin Arg Phe Val His Ser Gly 

100 105 110 



Glu Ser Glu Val Val Val Pro Thr Leu He Asp Glu Asp Pro Gin Leu 
115 120 125 

Arg Glu Leu Phe Met His Ala Met Asp Glu Ser Arg Phe Ala Phe Asn 
130 135 140 

Glu Leu Leu T^n Ala Leu Glu Glu Lys Leu Gly Asp Glu Pro Asn Ala 
145 150 155 160 

Leu Leu Arg Lys Lys Gin Ala Arg Gin Ala Ala Arg Ala Val Leu Pro 

165 170 175 



Asn Ala Thr GIU Ser Arg He Val Val Ser Gly Asn Phe Arg Thr Trp 

180 185 190 



Arg His Phe He Gly Met Arg Ala Ser Glu His Ala Asp Val Glu He 
195 200 205 
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Arg Glu Val Ala Val Gly Cys Leu Arg Lys Leu Gin Val Ala Ala Pro 
210 215 220 

Thr Val Phe Gly Asp Phe Glu He Glu Thr Leu Ala Asp Gly Ser Gin 
225 230 235 240 

Met Ala Thr Ser Pro Tyr Val Met Asp Phe 

245 250 



<210> 17 

<211> 551 

<212> DNA 

<213> Hpal-PvuII fragment comprising the PI promoter 



60 
120 
180 
240 
300 
360 



<400> 17 ^ . . ^. . . 

aaccggtgtg gagccgacca ttccgcgagg ctgcactgca acgaggtcgt agttttggta 

catggcttct ggccagttca tggattggct gccgaagaag ctataggcat cgccaccagg 
gccaccggag ttaccgaaga tggtgccgtg cttttcgcct tgggcaggga ccttgacaaa 
gcccacgctg atatcgccaa gtgagggatc agaatagtgc atgggcacgt cgatgctgcc 
acattgagcg gaggcaatat ctacctgagg tgggcattct tcccagcgga tgttttcttg 
cgctgctgca gtgggcattg ataccaaaaa ggggctaagc gcagtcgagg cggcaagaac 
tgctactacc ttttttattg tcgaacgggg cattacggct ccaaggacgt ttgttttctg 420 
ggtcagttac cccaaaaagc atatacagag accaatgatt tttcattaaa aaggcaggga 
tttgttataa gtatgggtcg tattctgtgc gacgggtgta cctcggctag aatttctccc 540 

551 

catgacacca g 
<210> 18 

« 

<211> 365 
<212> DNA 

<213> truncated 0RF2 



480 



<220> 
<221> 
<222> 



CDS 

(1)..(365) 
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rta^acc^qaa caa gtt aaa ttg age gtg gag ttg ata gcg tgc agt tct 
S? fll Glu fin Wal Lys Leu Ser Val Glu Leu He Ala Cys Ser Ser 
1 5 . 10 " 

ttt act cca ccc get gat gtt gag tgg tea act gat gtt gag ggc gcg 
Phe ?hr Pro Pro Ala Lp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 

20 25 J*' • 

gaa gca etc gtc gag ttt gcg ggt cgt gcc tgc tac gaa act ttt gat 
Glu Ila Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 
35 40 45 

aag ccg aac cct cga act get tec aat get gcg tat ctg cge cac ate 
Lys ?r? Asn Pro Arg Thr Ala Ser Asn Ala Ala Tyr Leu Arg Hxs He 
' 50 55 60 

ata gaa gtg ggg cac act get ttg ett gag cat gcc aat gee acg atg 
Set Si Sal Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 
65 ■'O '5 

tat ate cga ggc att tct egg tec gcg ace cat gaa ttg gtc cga cac 
ijr ?le Sg Gly He Ser Arg Ser Ala Thr His Glu Leu Val Arg H« 

85 

cge eat ttt tee ttc tct caa ctg tct cag cgt ttc gtg cac age gga 

Ser 
100 



48 



cge cat ttt tec ttc tct caa ctg tci: cag ^yu 

Sg His Phe ser Phe Ser Gin Leu Ser Gin iVrg Phe Val Hxs Ser Gly 



gaa teg gaa gta gtg gtg ccc act etc at 
Glu Ser Glu Val Val Val Pro Thr Leu lie 
115 



<210> 19 

<211> 122 

<212> PRT 

<213> truncated 0BF2 



M^rLa^Glu Gin Val Lys Leu Ser Val Glu Leu He Ala Cys Ser Ser 
1 5 10 

Phe Thr Pro Pro Ala Asp Val Glu Trp Ser Thr Asp Val Glu Gly Ala 

20 25 30 

Glu Ala Leu Val Glu Phe Ala Gly Arg Ala Cys Tyr Glu Thr Phe Asp 
35 40 45 

Lys Pro Asn Pro Arg Thr Ala Ser Asn Ala Ala Tyr Leu Arg His He 
50 55 60 

Met Glu val Gly His Thr Ala Leu Leu Glu His Ala Asn Ala Thr Met 
65 10 ''^ 



96 



144 



192 



240 



288 



336 



365 
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Tyr He Arg Gly He Ser Arg Ser Ala Thr His Glu Leu Val Arg His 

85 90 95 



Afg His Phe Ser Phe Ser Gin Leu Ser Gin Arg Phe Val His Ser Gly 

100 105 110 



Glu Ser Glu Val Val Val Pro Thr Leu He 
115 120 



<210> 20 

<211> 833 

<212> DNA 

<213> truncated LysA ('LysA) 



<220> 

<221> CDS 

<222> (1)..(833) 



<400> 20 

atg get aca gtt gaa aat ttc aat gaa ctt ccc gca cac gta tgg cca 

Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 
15 10 15 

cgc aat gca gtg cgc caa gaa gac ggc gtt gtc acc gtc get ggt gtg 

Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 

20 25 30 



gac gag gac gat ttc cgt tec cgc tgt cgc gac atg get acc gca ttc 
Asp Glu Asp Asp Phe Arg Ser Arg Cys Arg Asp Met Ala Thr Ala Phe 
50 55 60 

ggt gga cca ggc aat gtg cac tac gca tec aaa gcg ttc ctg acc aag 
Glv Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 
65 70 75 80 

acc att gca cgt tgg gtt gat gaa gag ggg ctg gca ctg gac att gcg 
Thr lie Ala Arg Trp Val Asp Glu Glu Gly Leu Ala Leu Asp He Ala 

85 90 95 



cgt ate ace gcg cac ggc aac aac aaa ggc gta gag ttc ctg cgc gcg 
Arg He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 
115 120 ' 125 



48 



96 



cct ctg cct gac etc get gaa gaa tac gga acc cca ctg ttc gta gtc 144 
Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 



192 



240 



288 



tee ate aat gaa ctg ggc att gee ctg gee get ggt ttc ccg gee age 336 
Ser He Asn Glu Leu Gly He Ala Leu Ala Ala Gly Phe Pro Ala Ser 

100 105 110 



384 
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ttg gtt caa aac ggt gtc ggg cat gtg gtg ctg gac tec gcg cag gaa 432 
Leu Val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 
130 135 140 



ttg gaa ctg ctg gat tac gtt gcc get ggt gaa ggc aag ate eag gac 
Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys lie Gin Asp 
145 150 155 160 

gtg ttg ate cgc gtg aag cca ggt ate gaa gcc cac ace cac gag ttc 
Val Leu lie Arg Val Lys Pro Gly lie Glu Ala His Thr His Glu Phe 

165 170 175 



gaa ggc ttc aag ctg gea gca gag cgc gtg ttg ggc ctg tac tea cag 
Glu Gly Phe Lys Leu Ala Ala Glu Arg Val Leu Gly Leu Tyr Ser Gin 
225 230 235 240 

ate cac age gaa eta ggt gtc gcc ctt ect gag ctg gac etc ggt ggc 
lie His Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 

245 250 255 

gga tac ggc ate gcc tac act gca gat gag gaa cca etc aac gtc gca 
Glv Tyr Gly He Ala Tyr Thr Ala Asp Glu Glu Pro Leu Asn Val Ala 

260 265 270 

gaa gtc gee tee gac ct 
Glu Val Ala Ser Asp Leu 
275 



<210> 21 

<211> 278 

<212> PRT 

<213> truncated LysA ('LysA) 

<400> 21 

Met Ala Thr Val Glu Asn Phe Asn Glu Leu Pro Ala His Val Trp Pro 
1 5 10 15 

Arg Asn Ala Val Arg Gin Glu Asp Gly Val Val Thr Val Ala Gly Val 

20 25 30 



480 



528 



ate gcc act age cac gaa gac cag aag ttc gga ttc tec ctg gca tec 576 
He Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 

180 185 190 



624 



ggt tec gea ttc gaa gca gcg aaa gca gcc aac aat gca gag aac ttg 
Gly Ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 
195 200 205 

aac ctg gtt ggt ctg cac tge cat gtt ggt tec cag gtg ttc gac gcc 672 
Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Phe Asp Ala 
210 215 220 



720 



768 



816 



833 



Pro Leu Pro Asp Leu Ala Glu Glu Tyr Gly Thr Pro Leu Phe Val Val 
35 40 45 
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Asp Glu Asp Asp Phe Arg Ser Arg Cys Arg Asp Met Ala Thr Ala Phe 
50 55 60 



Gly Gly Pro Gly Asn Val His Tyr Ala Ser Lys Ala Phe Leu Thr Lys 
65 70 75 80 



Thr lie Ala Arg Trp Val Asp Glu Glu Gly Ldu Ala Leu Asp lie Ala 

85 90 95 



Ser lie Asn Glu Leu Gly lie Ala Leu Ala Ala Gly Phe Pro Ala Ser 

100 105 110 



Arg He Thr Ala His Gly Asn Asn Lys Gly Val Glu Phe Leu Arg Ala 
115 120 125 



Leu Val Gin Asn Gly Val Gly His Val Val Leu Asp Ser Ala Gin Glu 
130 135 140 



Leu Glu Leu Leu Asp Tyr Val Ala Ala Gly Glu Gly Lys He Gin Asp 
145 150 155 160 



Val Leu He Arg Val Lys Pro Gly He Glu Ala His Thr His Glu Phe 

165 170 175 



He Ala Thr Ser His Glu Asp Gin Lys Phe Gly Phe Ser Leu Ala Ser 

180 185 190 



Gly Ser Ala Phe Glu Ala Ala Lys Ala Ala Asn Asn Ala Glu Asn Leu 
195 200 205 



Asn Leu Val Gly Leu His Cys His Val Gly Ser Gin Val Piie Asp Ala 
210 215 220 



Glu Gly Phe Lys Leu Ala 7U.a Glu Arg Val Leu Gly Leu Tyr Ser Gin 
225 230 235 240 



He His Ser Glu Leu Gly Val Ala Leu Pro Glu Leu Asp Leu Gly Gly 

245 250 255 



Gly Tyr Gly He Ala Tyr Thr Ala Asp Glu Glu Pro Leu Asn Val Ala 

260 265 270 



Glu Val Ala Ser Asp Leu 
275 



